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MEASURING RECESSIONS* 


Grorrrey H. Moore 
National Bureau of Economic Research 


1, INTRODUCTION AND SUMMARY 


HE purpose of this report is to provide a set of measurements of past 
business cycle recessions with which any current recession can be compared. 
The contractions in business activity that the American economy has ex- 
perienced from time to time have, of course, varied widely in severity. Yet 
even severe depressions have often begun gradually. How soon can a severe 
decline be detected? How do the relative declines in the various available 
measures of economic activity compare with one another as a contraction de- 
velops? How can one determine whether a contraction that is currently under 
way is already or is going to be smaller or larger than those that have occurred 
in the past? How can one judge when it is about to end? The measurements 
presented here suggest possible ways of providing answers to questions such as 
these. Although the body of the report deals with measurements for recessions 
that have already run their full course, we shall, at the end, show how the 
method has worked out from month to month during the recession that began 
in 1957. 
First, let us glance at the historical record of twenty-four business contrac- 
tions given in Tables 260 and 261. The peak dates are the months when ex- 





* Research along the lines developed in this report began during the recession of 1953-54, when tables compar- 
ing the percentage changes in a long list of economic series during the current and preceding recessions were prepared 
for Arthur F. Burns, then Chairman of the Council of Economic Advisers. In October 1957 R. J. Saulnier, present 
Chairman of the Council, requested the assistance of the National Bureau in preparing a similar set of tables. Tables 
covering some seventy monthly and quarterly economic series were promptly prepared. The Council compensated 
the National Bureau for the costs of this part of the statistical work. The electronic computer program was devel- 
oped with the aid of a grant from the National Science Foundation. The International Business Machines Corpora- 
tion generously contributed machine time on the 704 computer. These resources were essential to the pursuit of the 
study. No less essential were the intellectual and financial resources that have over the years been invested in the 
National Bureau’s studies in business cycles. The present report is, in the truest sense, a product of these contribu- 
tions, for the data and methods used here are virtually all derived in one way or another from this earlier work. 

The statistical tables were prepared by Sophie Sakowits, Dorothy O’Brien, and Sandra Renaud. Charlotte 
Boschan prepared and tested the electronic computer program. Alexander Pitts developed the materials necessary 
for the selection of the 1957 business cycle peak date and reviewed all the other peak dates. The charts were drawn 
by H. Irving Forman. Mary Phelps edited the manuscript. I am greatly indebted to these individuals as well as to 
others of the National Bureau staff for their wholehearted efforts to complete the job as speedily as possible. The 
comments of many who read the manuscript aided in its revision. These included Leo Grebler, Philip Klein, Maurice 
W. Lee, Ruth P. Mack, Roland I. Robinson, W. Allen Wallis, Donald B. Woodward, and Victor Zarnowitz. I am 
especially obliged to Arthur F. Burns, Solomon Fabricant, and Julius Shiskin for their sound advice and stimulating 
comments at all stages of the work. 

The paper has been approved for publication as a report of the National Bureau of Economic Research by the 
director of research and the Board of Directors of the National Bureau, in accordance with the resolution of the 
beard governing National Bureau reports (see the Annual Report of the National Bureau of Economic Research). 
Tt is to be reprinted as No. 61 in the National Bureau's series of Occasional Papers. 
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TABLE 260 


THE DURATION OF BUSINESS CYCLE EXPANSIONS AND CONTRACTIONS 
IN THE UNITED STATES, 1854-1957 


























Duration of 
Business Cycle 
Expansion Contraction 
Trough Peak Trough (months) 
Dec. 1854 June 1857 Dec. 1858 30 18 
Dec. 1858 Oct. 1860 June 1861 22 8 
June 1861 Apr. 1865 Dec. 1867 46 32 
Dec. 1867 June 1869 Dec. 1870 18 18 
Dec. 1870 Oct. 1873 Mar. 1879 34 65 
Mar. 1879 Mar. 1882 May 1885 36 38 
May 1885 Mar. 1887 Apr. 1888 22 13 
Apr. 1888 July 1890 May 1891 27 10 
May 1891 Jan. 1893 June 1894 20 17 
June 1894 Dec. 1895 June 1897 18 18 
June 1897 June 1899 Dec. 1900 24 18 
Dec. 1900 Sep. 1902 Aug. 1904 21 23 
Aug. 1904 May 1907 June 1908 33 13 
June 1908 Jan. 1910 Jan. 1912 19 24 
Jan. 1912 Jan. 1913 Dec. 1914 12 23 
Dec. 1914 Aug. 1918 Mar. 1919* 44 7 
Mar. 1919* Jan. 1920 July 1921* 10 18 
July 1921* May 1923 July 1924 22 14 
July 1924 Oct. 1926 Nov. 1927* 27 13 
Nov. 1927* Aug. 1929* Mar. 1933 21 43 
Mar. 1933 May 1937 June 1938* 50 13 
June 1938* Feb. 1945 Oct. 1945 80 8 
Oct. 1945 Nov. 1948 Oct. 1949 37 11 
Oct. 1949 July 1953 Aug. 1954 45 13 
Aug. 1954 July 1957 35 
Average, 24 cycles, 1854-1954 29.9 $ 19.9 








For an explanation of the method used to determine the business cycle peak and trough dates and some tests 
of their validity, see Arthur F. Burns and Wesley C. Mitchell, Measuring Busi Cycles (National Bureau of 
Economic Research, 1946), Ch. 4. A few of these dates (designated by an asterisk) have been revised since the 
Burns-Mitchell report, and the list has been carried forward to date. 

* Revised. 





pansion of aggregate economic activity culminated and contraction began, as 
judged from a variety of statistical records; the trough dates specify when con- 
traction culminated and expansion began. The measures of duration show that 
five of the eight contractions since 1920 have lasted roughly a year (11 to 14 
months). One was somewhat shorter (8 months), one somewhat longer (18 
months), and one very much longer (43 months). Before 1920, contractions 
frequently lasted more than a year—indeed, ten out of sixteen between 1857 
and 1919 lasted 18 months or more. The reasons for the apparent reduction in 
the typical length of contraction are not fully known, and we do not know 
whether it can be counted on as a permanent shift.! The intervening intervals 





1 For some observations suggesting that the durations of busi tracti (and expansions) may be subject 
to long swings associated with the construction cycle and related developments, see the report by Moses Abramovits 
in the National Bureau's Thirty-eighth Annual Report (May 1958). 
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of expansion have typically been substantially longer than the contractions, 
many of them lasting two or three years. This has been just as true since 1920 
as it was before. 

The durations of the expansions and the contractions are not sufficiently 
uniform or regular to give one more than a very rough notion about how long 
an expansion or contraction might be expected to last when it has just begun. 
After a year or so has elapsed, however, it may be of some help to know how 
frequently or infrequently phases of given lengths occur. Thus at the end of 
1956 one could say this about the expansion that had begun in August 1954: 

“If the current expansion were to continue through all of 1957, it will have 
lasted forty months. In the National Bureau’s business cycle chronology cover- 
ing the past 100 years there are only five expansions (out of twenty-four) that 
lasted as long as forty months: June 1861—April 1865, forty-six months; De- 
cember 1914—August 1918, forty-four; March 1933—May 1937, fifty; June 1938— 
February 1945, eighty; and October 1949-July 1953, forty-five. Four of these 
expansions encompassed major wars, and one was the recovery from the Great 
Depression. Clearly, if the present expansion extends through 1957 without a 
setback it will establish a new precedent.”? This bit of information in itself, of 
course, was not enough to forecast a recession, but it could usefully be con- 
sidered together with other more direct and more important evidence. It now 
appears that the expansion came to an end in July 1957, that is, after 35 months 
(see below). 

Table 261 shows the size of the declines between the business cycle peak and 
trough dates since 1920, as registered by several widely used measures of busi- 
ness activity. Clearly, a contraction that appears more severe than another by 
one measure may appear less severe by another measure. The 1953-54 contrac- 
tion was somewhat greater than 1948-49 when measured in terms of the per- 
centage decline in industrial production or the increase in the unemployment 
rate, but somewhat less than 1948-49 when measured by the percentage decline 
in gross national product or in nonagricultural employment.’ Nevertheless, one 
can construct at least a rough ranking of the contractions according to severity. 

At the top of the list of recent contractions, obviously, is the contraction that 
began in 1929—the longest and deepest. The only other that comes close to it 
in the National Bureau’s 100-year chronology is the contraction of 1873 to 
1879. Next most severe among those since 1920 are the contractions of 1920-21 
and 1937-38. Both were very sharp and fairly short, but that of 1937-38 began 
when unemployment was still at a very high level, much higher than in 1920. 
The contraction of 1923-24 was of moderate amplitude, not unlike that of 
1948-49. The most recent contraction, 1953-54, was in most respects of slightly 
lesser magnitude than the contraction of 1948-49, yet greater than that of 
1926-27, and certainly more widely recognized. There remains the brief con- 
traction after World War II, February—October 1945, which marked the tran- 
sition from a wartime to a peacetime economy, and which is the most difficult 





? Thirty-seventh Annual Report, National Bureau of Economic Research (May 1957), p. 53. 

+ Note that the table measures the percentage declines between business cycle peak and trough dates. Some- 
what different results would be obtained if the declines were measured from the specific peak in each indicator to its 
trough. The latter method has some advantages for the purpose of measuring the amplitude of business cycles, and 
some disadvantages. In practice we have used both methods, 
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of all to characterize because different measures yield such different results. 
However, in terms of its impact upon the well-being of the population it must 
surely be classed among the more modest of those in our list.‘ 

In order to have a definite scale we shall use the following ranking of con- 
tractions according to severity, excluding the 1945 episode because of its special 
character. The ranking is based partly on the information in Table 261 (see 
second section of table) and partly on other information bearing on the depth 
of these contractions (see note 3 above). 








Contraction Rank 





Oct. 1926 Nov. 1927 1 (mildest) 
July 1953 Aug. 1954 2 
Nov. 1948 Oct. 1949 3 
May 1923 July 1924 4 
Jan. 1920 July 1921 5 
May 1937 June 1938 6 

7 


Aug. 1929 Mar. 1933 (most severe) 








We can then construct, for each of these contractions, measures that show 
by how much business activity declined from the peak as the contraction con- 
tinued, and compare such measures with the above ranking. Such measures 
should tell us at about what stage—that is, how many months after the con- 
traction began—the relative severity of each recession became evident, and 
how it manifested itself in different aspects of economic activity, such as pro- 
duction, employment, incomes, prices. Similar measures constructed during the 
course of a current contraction can then be used to appraise its severity and its 
scope compared with earlier contractions. 

One of the prerequisites for such an analysis is that the current contraction is 
known or is believed to have begun, so that the date from which it starts, i.e., 
the peak of the business cycle, can be fixed. Of course, such a date may be 
selected tentatively, when a contraction is only suspected. If the assumption 
turns out to be an error, the error need not long persist. Experience suggests 
that the date of the peak can be determined with reasonable accuracy fairly 
soon after it occurs.’ 

Study of materials developed along these lines and presented later in the re- 
port suggests the following tentative conclusions: 





4 Another period that might be considered a *yclical contraction is 1951-52. Many sectors of the ny suf- 
fered setbacks at this time, after the rapid upsurge in 1950 when the Kcrean War began. Nevertheless, although the 
rate of growth of aggregate economic activity slackened perceptibly there was no appreciable over-all decline in 
output, income, or employment, and no rise in unemployment, since th 2 defense industries kept expanding rapidly. 
Therefore we do not consider it to be a business cycle contraction. For an analytic description of this period see 
Bert G. Hickman, The Korean War and United States Economic Activity, 1950-1952 (National Bureau of Economic 
Research, Occasional Paper 49, 1955). For the definition of business cycles followed in our work, and a discussion of 
its historical application, see Arthur F. Burns and Wesley C. Mitchell, Measuring Business Cycles (National Bureau 
of Economic Research, 1946), Chapters 1 and 4. 

5 For a description of some methods for accomplishing this see Arthur F. Burns, Frontiers of Economic Knowl- 
edge (National Bureau of Economic Research, 1954), especially pp. 107-34, 179-80, and Geoffrey H. Moore, 
Statistical Indicators of Cyelical Revivals and R ions (National Bureau of Economie Research, Occasional Paper 
31, 1950). 
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1. When a business recession begins, most broad indicators of aggregate eco- 
nomic activity (production, employment, income, trade) show relatively slight 
declines, and during the first six months of the recession the magnitude of the 
declines bears little relation to the ultimate severity or depth of the recession. 

2. About six months after a recession begins, the percentage declines from 
the peak month to the current month in most economic aggregates are smaller 
in mild recessions than in severe recessions, and this ranking is maintained in 
succeeding months with little change. 

3. When such comparisons are made for types of economic data that typi- 
cally begin declining before a recession starts (for example, new orders, con- 
struction contracts, the average workweek, stock prices) the distinction be- 
tween mild and severe recessions begins to appear as early as three or four 
months after the recession begins, and is also substantially maintained in suc- 
ceeding months. 

4. Although frequently both mild and sharp business contractions have 
ended within about a year, the recovery to the previous peak level has been 
accomplished much more quickly after mild contractions. Hence the period of 
depressed activity has been much longer when the contraction proceeded at a 
rapid rate. 

5. While the above conclusions suggest that a rough ordering of recessions 
according to severity can be made within four to six months after the onset, 
they do not imply that either the ultimate depth or the duration of recessions 
can be reliably forecast by this means. Many factors not taken into account by 
the method, such as governmental measures taken to combat depression, have 
an important bearing on the severity and duration of business contractions. The 
method appears useful primarily in providing a yardstick against which a cur- 
rent decline in various aspects of economic activity can be gauged, and thereby 
facilitating a more accurate and enlightened appraisal of what has already 
taken place. This in itself might facilitate the development of appropriate 
counter-cyclical programs. __ 

6. Measures of the strength of various counter-cyclical factors (for example, 
unemployment compensation payments, increased governmental expenditures, 
easier credit terms, lower taxes) at similar stages of recession might be devel- 
oped on the same plan as described here, although it is not attempted in this 
study. Such measures might be of assistance in judging the prospects for further 
business contraction or for a resumption of economic expansion. 

7. Several months before a recession comes to an end and an upturn in 
aggregate activity occurs, a progressive narrowing of the scope of contraction 
ordinarily becomes visible. Fewer activities continue to decline, more begin to 
rise. It appears first in series of the “leading” type. The more extensive and 
more sustained this reduction in the scope of the contraction is, the more 
likely that it marks the real end of recession rather than an abortive recovery. 
Information of this sort may help to identify an upturn in aggregate activity 
at about the time it occurs or shortly thereafter. 

8. When the methods developed in this investigation are applied to the 
business contraction that began in July 1957, we find that: 


a. After eight months of contraction, i.e., through March 1958, most indicators have 
declined more than in the corresponding periods of the four milder contractions 
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since 1920 (1923-24, 1926-27, 1948-49, 1953-54) and less than in the three more 
severe contractions (1920-21, 1929-30, 1937--38). 

. The intermediate position of the 1957-58 contraction first became apparent in 
data for the leading indicators for November 1957, i.e., four months after the 
peak of July 1957. It was confirmed by most indicators of aggregate economic 
activity when data for February 1958 became available. 

. In contractions of the severity indicated for the 1957-58 contraction, it would be 
in line with previous experience if the level of economic activity generally remained 
below the previous peak level (July 1957) for a period ranging from a year rand a 
half to two and a half years. 

. One of the outstanding features of the first eight months of the 1957-58 contraction 
has been the relatively modest decline in personal income. The rise in consumers’ 
prices has been less unusual, since increases occurred during the first eight months 
of four of the seven business contractions since 1920. 


9. The tentative findings reported above need to be tested further. The 
method could usefully be tested on declines that did not reach business cycle 
proportions. Comparisons based on a different method of dating downturns— 
e.g., dating the downturn from the peak in the specific series being compared— 
should be made, and other ways of measuring the severity of recessions should 
be explored. The empirical results should be examined in the light of the hy- 
potheses that have been advanced to account for variations in the severity of 
business cycle contractions. Work along these lines will be facilitated now be- 
cause electronic computer programs are available to handle the computations. 


2. CHANGES IN AGGREGATE ECONOMIC ACTIVITY DURING 
THE FIRST YEAR OF RECESSION 


Table 266 shows how a comparison of developments during the first year of 
mild and severe business contractions works out for one widely used economic 
indicator, the Federal Reserve index of industrial production. Percentage 
changes are computed from the peak standing (a three-month average that in- 
cludes the business cycle peak month, the month preceding and the month fol- 
lowing) to one month after the peak, two months after the peak, and so on up 
to twelve months after the peak. The table covers the seven business cycle con- 
tractions since 1920 (excepting the contraction that followed World War II). 
Note that the peak dates are not necessarily those at which the production 
index reached its peak, but rather when business activity at large did so. 
Usually the peak in the production index has not differed by more than a month 
or two from the business cycle peak.’ Use of the business cycle peak enables us 
to examine a wide variety of series on a comparable basis (see below). 





* Two important contributions have already been made. Julius Shiskin has constructed and analyzed an ex- 
tensive set of measures of the scope, magnitude and rate of change in the separate industry components of various 
economic aggregates such as employment, production, and new orders. He has compared the current contraction 
with those beginning in 1953, 1948, 1937, and 1929 on a plan similar to that used here, and also on a plan that uses 
the “specific cycle peak” dates in each aggregate as the point from which to start the comparison. This work has 
been carried on at the Bureau of the Census for the Council of Economic Advisers. 

Pao Lun Cheng, Michigan State University, in a paper on “Statistical Indicators and Cyclical Amplitudes,” 
presented at the annual meeting of the Midwest Economics Association, Des Moines, Iowa, on April 19, 1958, 
explores the relations between the severity of business cycle contractions and the rates of change in indicators prior 
to and during the contractions. Part of this work is along lines very similar to those followed here, and yields similar 
conclusions. In addition, however, Dr. Cheng tests a number of interesting hypotheses that go well beyond our own 
work. 

7 At one cf the peaks the difference was 5 months; at one, 2 months; at three, 1 month; and at two there was 
no difference. In the current recession the difference is somewhat greater than usual. The peak in the production 
index, according to revised figures published in March 1958, was reached in December 1956 or February 1957 (146 
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TABLE 266 


PERCENTAGE CHANGES IN INDUSTRIAL PRODUCTION DURING 
SEVEN BUSINESS CYCLE CONTRACTIONS 








Business Cycle Peak 
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July | Nov. May Jan. May | Aug. 


Months 1953 1948 1923 1920 1937 1929 
after 





Peak Standing at Peak (1947-49: 100)* 


103.7 | 49.0 | 42.7 | 64.7 | 60.7 








Percentage Change from Peak 
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® Three-month average, centered on the peak month. 

> A rank of 1 is given to the smallest decline (or largest rise) among the seven contractions, a rank of 7 to the 
largest decline, etc. 

Source: Board of Governors of the Federal Reserve System. Index is adjusted for seasonal variations. The 
seven busi cycle tracti are arrayed from left to right according to their over-all severity (see text). 





Table 26€ makes it clear that changes in the volume of industrial output in 
the first month or two of a business recession are typically slight. Declines of 
only 2 or 3 per cent are the rule. Moreover, the relative severity of the decline 
manifests itself only irregularly during the first six months or so after the con- 





in both months), seven or five months before the business cycle peak in July 1957. Before revision, the index showed 
December 1956 one point higher than February. The revised index declines to 144 in April and May, rises to 145 in 
June, July and August, and then declines to date. 
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traction begins. The mild recessions of 1948 and 1953 began with a relatively 
sharp decline in industrial production, although after five months the declines 
were not as great as in the major contractions of 1929 and 1937. It is important 
to realize that in the first few months of what turns out to be a mild recession 
the decline in output may be as sharp or sharper than in a severe contraction. 
Furthermore, a severe contraction like that of 1920-21 can start out with only 
a moderate decline in producticn. The ultimate severity of the 1929 contraction 
in comparison with the 1921 or the 1937 contraction was not evident in terms 
of the magnitude of the decline in industrial output even after twelve months 
had elapsed. Nevertheless, these measurements do permit at least a rough 
classification of contractions according to severity after about six months, and 
the validity of the classification improves as the span increases. It can be made 
more dependable by reference to other data, as we shall see. 

In reading Table 266 and similar tables it is important to recall that all eco- 
nomic series are subject to erratic movements due to such factors as strikes, 
international incidents, unusual weather, flu epidemics, and the like. These 
have less influence on broad aggregates, such as the index of production, than 
on data for narrow sectors, as a rule. Nevertheless, each monthly figure should 
be scrutinized in relation to the evidence provided by adjacent months, as well 
as other information. A graphic record, like that provided in Chart 268, will 
help. In some cases, too, the absolute level of the figures should be considered, 
as well as the change from the peak level. That is why, for example, the unem- 
ployment rate is shown in Chart 268 as an absolute rate, rather than simply in 
terms of the change in the rate. 

Another factor that should be taken into account is the possible error in 
our dates for the business cycle peaks. For example, recently we revised the 1929 
date, shifting it from June to August. July is a close competitor with August 
in this choice, but both appear definitely superior to June in terms of the data 
now available. The peak in January 1920 may be too early by a month or two. 
A similar difficulty exists in the choice between July and August 1957 (see text 
below). The use of a three-month average centered on the peak month tends to 
reduce the effect of such uncertainties on our measures of changes. But their 
more important effect is on the number of months a decline has been under 
way by a given date. If the peak is dated two months early, the fifth month 
after the erroneous peak is actually only the third month after the true peak. 
The relative effects of such errors, however, tend to diminish as the interval 
from the peak lengthens (see note 14, below). 

Tables similar to Table 266 have been compiled for a number of other im- 
portant aggregative measures of economic activity, and are reproduced in 
Appendix A. Here we shall consider only the rankings of the percentage changes 
(Table 273). Since these series figure significantly in our determination of busi- 
ness cycle turning dates, usually their peaks and that of the business cycle 
closely coincide. At any given turn, however, some may continue to rise for a 
few months, as the table indicates. On the other hand, some may have begun 
to decline before the designated peak month, so that in these cases the ranking 
is based on only that part of the decline that took place after the business 
cycle peak. 
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* All series except the unemployment rate are converted to indexs, using the three-month average centered on 
the business cycle peak months or quarters as the base (= 100). The peak months (quarters) are: August (3Q) 1929, 
May (2Q) 1937, November (4Q) 1948, July (2Q) 1953, and July (3Q) 1957. All series except industrial stock prices 
and wholesale prices are adjusted for seasonal variations. 
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Cuart 268 (continued) 
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Cuart 268 (continued) 
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On the whole, Table 273 confirms the showing of Table 266. The initial 
changes in measures of the general level of economic activity may be quite mis- 
leading if regarded as an indication of the severity or mildness of the impending 
decline. It is not until some six months after the peak that the ultimate severity 
of the contraction, in relative terms, is reflected even moderately well in most 
of these series.* 

It appears that some series are less reliable at a given interval after the peak 
than others (Table 274). Railroad freight carloadings, retail sales, and the 
wholesale price index (excluding farm products and foods) show relatively low 
correlations. In the case of carloadings, the long-run decline in the relative 
share of the railroads in freight traffic due to the competition of other carriers 
has tended to increase the severity of the recent declines. For example, in 1920— 
21 the decline in carloadings was only a third as large as the decline in industrial 
production during the first eight months; in 1929-30 during the same period 
the drop in carloadings was two-thirds as great as in industrial production; in 
1953-54 the decline in carloadings was one and a half times that in production; 
in 1957-58, carloadings fell 18 per cent during the first eight months, which 
again is about one and a half times the drop in industrial production. 

The failure of retail sales to correlate well with the severity of business con- 
tractions may be due to lack of comparability of the data for earlier cycles. The 
figures for 1929-30 and earlier recessions are limited to department stores, 
whereas the later figures cover all types of retail store. However, department 
store experience was probably more nearly representative of total retail sales 
in the twenties than it is today. In the case of the wholesale price index it ap- 
pears that initial declines have been sharper in some of the mild business con- 
tractions than in the more severe. Until ten months after the business cycle 
peak the correlation is inverse, though small. 

Some of the erratic factors that may affect results based on a single indicator 
can be ironed out by averaging several indicators. The six indicators that show 
the most consistently high correlations in Table 274 (nonagricultural employ- 
ment, gross national product, industrial production, bank debits outside New 
York City, personal income, and corporate profits) taken together provide the 
rankings shown in Table 275. The average ranks are computed only for 3, 6, 
9, and 12 months after the peak because two of the series are available only 
quarterly. 

3. CHANGES IN “LEADING INDICATORS” DURING THE 
FIRST YEAR OF RECESSION 


The series used in Table 273 are precisely those in terms of which the ulti- 
mate severity of a business recession is likely to be judged.* In attempting an 





8 The .05 significance level for a Spearman rank correlation coefficient based on seven observations, as most of 
the coefficients in Table 273 are, is .71 (Sidney Siegel, Nonparametric Statistics for the Behavioral Sciences, 1956, 
p. 284). In appraising the coefficients in Tables 273 through 276, however, one should keep in mind that the co- 
efficients for a given series for different monthly spans are not statistically independent, that the coefficients for 
different series for the same span are not independent, and that the .05 significance level for coefficients based on the 
average ranks of groups of series is certainly less than .71, though not as much less as it would be if the series were 
independent. The coefficients are presented mainly to provide a convenient summary statistic on the degree of 
relationship between the changes in the indicators and the severity of the contractions. 

* The number of unemployed or the unemployment rate should of course be considered in any such appraisal. 
They are omitted from Table 273 because the available thly data for tracti before World War II are not 





comparable in magnitude with current data. The unemployment rate is included in Table 278, where only the 
directions of change are utilized. 
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TABLE 275 


RANKING OF SEVEN BUSINESS CYCLE CONTRACTIONS ACCORDING 
TO SEVERITY IN SUCCESSIVE MONTHS AFTER PEAKS, 
TWO GROUPS OF INDICATORS 




































































Rank of Average Rank 
Contraction beginning Rank Corre- 
a lation with 
~ = Oct. July Nov. May Jan. May Aug. | Severity of 
1926 1953 1948 1923 1920 1937 1929 | Contraction 
(1) (2) (3) (4) (5) (6) (7) 
Six Indicators of Aggregate Economic Activity* 
3 2 3 7 5 4 1 6 .18 
6 1 4 5 3 2 6 7 .68 
9 2 3 4 1 5.5 7 5.5 .72 
12 1 2 4 3 7 6 5 -82 
Ten Leading Indicators? 
1 1 3.5 6 3.5 5 7 2 .32 
2 1 2 7 3 6 5 4 .50 
3 1 2 5 3.5 3.5 6 7 .88 
4 1 2 3 1 6 5 7 .96 
5 1 2 3 4 5 7 6 .96 
6 1 3 4 2 5 7 6 .86 
7 1 3 4 2 5 7 6 .86 
8 1 2.5 4 2.5 6 7 5 .83 
9 1 2.5 4 2.5 7 6 5 .79 
10 2.5 2.5 1 4 7 6 5 .74 
11 1 3 2 4 7 6 5 .83 
12 3 1 2 4 7 5.5 5.5 As 
Souree: Tables 273 and 276 and Appendix A. The coven busi contracti are arrayed from left to right 
according to their over-all severity (see text). A rank of 1 indicates a small decline, a rank of 7 a large decline. 





* Nonagricultural employment, gross national product, industrial production, bank debits, personal income, 
corporate profits (see text). 
> See Table 276. 


early estimate of relative severity, however, other types of series may be more 
helpful. In the National Bureau’s studies of business cycles numerous series 
have been uncovered that typically turn down some months before the peak in 
general business. Because of their sensitivity, they may provide an early indi- 
cation of the severity of contractions. Table 276 contains the rankings of the 
percentage changes for a group of them, based on the figures provided in Ap- 
pendix A. 

We find that even as early as the third month after the peak the declines in 
many of the leading series begin to differentiate the more severe from the less 
severe recessions. Hence something can be gained by watching the early movers. 
Again, some appear less reliable for the purpose than others. The layoff and 
accession rates, business failure liabilities, and new orders for durable goods 
show relatively low or erratic correlation coefficients (Table 274). Because of 
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the erratic month-to-month changes that many of these series exhibit, it is 
especially desirable to study all the evidence rather than rely on data for a 
single month or a single series. The average ranks for all ten leading indicators 
provide one means for doing this (Table 275), though of course the individual 
series and their relations to one another and to other series should be studied 
as well. As a group, the leaders provide a more reliable indication of severity 
than the aggregates during the first six or possibly nine months of recession.'° 


4. CHANGES IN THE SCOPE OF BUSINESS CYCLE CONTRACTIONS 


The measures provided in Appendix A can help, also, to show how wide- 
spread the economic contraction is and whether it is spreading further. Table 
278 shows how this may be done, using ten indicators of aggregate economic 
activity and ten leading indicators. The left-hand section of the table indicates, 
for successive months after the business cycle peak, what proportion of the se- 
ries were above their level at the cycle peak (that is, a three-month average 
centered on the peak date). Very few of the aggregates (section A) reached such 
a favorable position during the first twelve months of the several recessions, 
although in the milder recessions one or two of the aggregates could be counted 
in this group in nearly every month. In 1953-54 eighteen months elapsed be- 
fore half the aggregates regained the levels they had attained at the business 
cycle peak, and in 1948-49 it took seventeen months to reach a similar position. 
In the more severe contractions none of the aggregates got back to the peak 
level in that length of time. Among leading indicators (section B), five of the 
ten were above the July 1953 peak levels within ten months, or three months 
before the business upturn in August 1954. In the 1948-49 contraction five of 
the ten leaders had exceeded the peak level after thirteen months, or two 
months after the business upturn began. In 1920, 1937, and 1929 none had re- 
gained the peak level even after a full year of contraction. In Chart 290 the solid 
curves depict these developments during the first 24 months of each contrac- 
tion. 

Even though an indicator has not risen sufficiently to exceed the level at- 
tained when business activity was at its zenith, it may nevertheless have begun 
to rise. It is vital to watch for these initial upward movements. The right-hand 
section of Table 278 (and the dotted curves in Chart 290) shows what propor- 





1° This implies that the leaders indicate the subsequent ranking of the aggregates. The following set of rank 
correlation coefficients, based on the ranks of the average ranks in Table 275, supports that inference, as well as the 
observation that the ranks of the leaders stabilize at an earlier date: 























Months after Peak 
3 6 9 

Leaders in specified month vs. 

Aggregates 3 mos. later .88 .95 -99 

Aggregates 6 mos. later -70 .86 a 

Leaders 3 mos. later .85 .88 .76 
Aggregates in specified month vs. 

Aggregates 3 mos. later -29 .59 .83 

Aggregates 6 mos. later —.14 .39 _ 
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tion of the indicators began to register recovery in that sense as the contractions 
proceeded. It compares the current monthly figure with the figure three months 
earlier." We find that in the 1953-54 and 1948-49 episodes substantial propor- 
tions of the ten aggregates were rising by the tenth month after the business 
recessions began (i.e., several months before the end of the contractions). 
But in 1937-38 and 1929-30 none of the aggregates were rising at this stage, 
even though in 1937-38 the end of the contraction came in the thirteenth 
month (June 1938). The recoveries in the leading indicators took place earlier 
and were more widespread. Even in 1937-38, half of the leaders had registered 
increases from the three-month-ago level by the tenth month, March 1938, 
thus heralding the end of that recession. 

A sustained narrowing of the scope of the contraction is a signal that it is 
about to end. For example, the 1937-38, 1948-49, and 1953-54 contractions 
ended about a year after they began. The evidences of sustained recovery be- 
came apparent at an earlier date in the milder 1948-49 and 1953-54 episodes 
than in the sharp 1937-38 contraction. In each case, however, the narrowing of 
the scope of the contraction occurred earlier in the leaders than in the aggre- 
gates. At times, as in 1923-24, a narrowing of the scope of contraction may be 
followed by a further widening before recovery finally gets under way.” 

The measures provided in Table 278, especially those showing the proportion 
of indicators that have exceeded their levels at the peak, suggest that if a 
period of depressed activity is defined as the interval from the peak to the date 
when activity regains the peak level, this period may be more closely related 
to the severity or magnitude of the decline from peak to trough than the dura- 
tion of the decline itself is. Of course, different indicators of aggregate economic 
activity regain their previous peak levels at different dates. If we take as one 
criterion the date when half of our ten indicators of aggregate activity regained 
their level of the preceding business cycle peak, and as an alternative estimate 
the date when the industrial production index did so, we obtain the results 
shown in Table 281. 

Not only are the periods of depressed activity substantially longer, as would 
be expected, than the contraction durations; they are also more closely related 
to the severity of the contractions. It took approximately a year and a half for 
the economy to recover, in the sense defined above, from the four mild or 
moderate contractions of 1926, 1953, 1948, and 1923; two and a half to three 
years to recover from the sharp contractions of 1920 and 1937; and at least 
seven years to recover from the 1929 contraction. 

Another way to look at these results is as follows. Except in the Great 
Depression about a year or year and a half elapsed before the economy gen- 
erated an upturn in business activity, whether the contraction was severe or 
moderate. But after the upturn the recovery to the previous peak was attained 





The choice of this interval involved a reconciliation of two conflicting considerations. A shorter interval 
would reflect a cyclical upturn more promptly, but would also reflect erratic movements more frequently. A longer 
interval would have the opposite advantage and disadvantage. The interval could well be different for different 
series. (A criterion for selecting an appropriate interval is developed by Julius Shiskin in Electronic Computers and 
Business Indicators, National Bureau of Economic Research, Occasional Paper 57, 1957.) 

For an analysis of the timing of changes in the scope of business cycle expansi and contracti and the 
relation of scope to severity, see the references cited in note 5 and also Geoffrey H. Moore, “The Diffusion of Business 
Cycles,” in Economics and the Public Interest, Robert A. Solo, editor (Rutgers University Press, 1955), pp. 35-64. 
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in a much shorter time after the moderate contractions than after the severe 
ones. These recoveries were accomplished in 7, 5, 6, and 4 months after the 
upturn in the 1926, 1953, 1948 and 1923 contractions, respectively (col. 6 
minus col. 3, Table 281). Recovery took roughly three times as long, 17 and 18 
months, after the upturns in the 1920 and 1937 contractions, and nearly four 
years in the Great Depression. Although recoveries from severe contractions 
have usually taken place at a more rapid rate than from moderate contractions 
(that is, the percentage rate of growth per month is higher), the difference has 


TABLE 281 


DURATIONS OF BUSINESS CONTRACTIONS AND PERIODS OF 
DEPRESSED ACTIVITY 














Date when Peak Level Period of Depressed Percentage 
Was Regained* Activity Decline in 
Business Business Duration of } Industrial 
Cycle Cycle Contraction, Estimate | Estimate | Production 
Peak Trough Monthsfrom| Estimate Estimate A B from Busi- 
Date Date (1) to (2) A B [months {months ness Cycle 
from (1) | from (1) Peak to 
to (4)] to (5)] Trough 
(1) (2) (3) (4) (5) (6) (7) (8) 
Oct. 1926 Nov. 1927 13 June 1928 | June 1928° 20 20 — 5.7 
July 1953 Aug. 1954 13 Jan. 1955 May 1955 18 22 — 9.5 
Nov. 1948 Oct. 1949 ll Apr. 1950 Apr. 1950 17 17 -— 7.7 
May 1923 July 1924 14 Nov. 19244 July 1925 18 26 —16.3 
Jan. 1920 July 1921 18 Dee. 1922 Oct. 1922 35 33 —29.0 
May 1937 June 1938 13 Dec. 1939 Oct. 1939 31 29 —31.5 
Aug. 1929 Mar. 1933 43 bd Nov. 1936 ® 87 —50.1 


























* Estimate A is the date when at least half of the following indicators of aggregate economic activity had re- 
gained their respective levels at the business cycle peak date: nonagricultural employment, unemployment rate, 
gross national product, industria! production, freight carloadings, bank debits, personal income, retail sales, cor- 
porate profits, wholesale price index. Not all indicators were available for every date. Estimate B is the date when 
the industrial production index regained the level it had reached at the business cycle peak (three-month average 
centered on the peak). 

> Half the indicators were above their peak levels temporarily in March and August 1927, or five and ten 
months after the peak, respectively, but before the trough was reached. 

© The production index attained the peak level temporarily in March 1927, five months after the peak but before 
the trough was reached. 

4 Half the indicators were above their peak levels temporarily in February 1924, nine months after the peak 
but before the trough was reached. 

® Industrial production was the only one of the ten indicators that exceeded its level as of the August 1929 peak 
before the next business cycle peak of May 1937. 


not been sufficient to compensate fully for the greater depth of the severe con- 
tractions. Hence the recoveries take longer.” 

All this suggests that our measures of the percentage declines in various indi- 
cators in the early stages of contraction may indicate, though only in a very 
rough way, the duration of “depression” as well as the severity of contraction. 
No great precision can be expected, however. The course of a contraction and 
the subsequent recovery is not foreordained. Both can be and have been in- 
fluenced by deliberate actions taken to shorten the contraction and hasten 





3 This implies a greater degree of uniformity in rates of cyclical expansion than in rates of contraction. For some 
observations on this point see my introduction to Daniel Creamer’s Personal Income during Business Cycles (Na- 
tional Bureau of Economic Research, 1956), pp. xxiii-xxviii, and the National Bureau's Thirty-seventh Annual 
neport, May 1957, p. 52. 
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recovery, as well as by events and policies that originate elsewhere. Historical 
patterns and relationships ought not to be transplanted mechanically. They 
can and should be used to help us formulate realistic appraisals of existing 
situations. 


5. OTHER APPLICATIONS 


So far, attention has been on the use of the measures provided in this report 
to indicate the severity of recessions and to judge the prospects of recovery. 
Another use may be mentioned briefly. By applying the measures to a wide 
variety of economic data one can determine some of the distinctive character- 
istics of each recession—what sectors or aspects of economic activity are strong 
and what are weak. For example, in both the 1953-54 and 1948-49 recessions 
residential building displayed great strength; in 1937-38 and 1929-30, notable 
weakness. These differences are sharply etched in the percentage changes in 
the volume of contracts (Appendix A). Six months after recession began, resi- 
dential contract volume (seasonally adjusted) had dropped 40 per cent in 1929 
and 21 per cent in 1937, but had risen 3 per cent in 1948-49 and 10 per cent in 
1953-54. Consumer instalment credit advanced vigorously in 1948—49 (it was 
28 per cent higher a year after the recession began), rose moderately in 1953- 
54 (4 per cent higher), but declined appreciably in 1929-30 (9 per cent lower), 
and 1937-38 (6 per cent lower). Such differentiation of the strong and weak sec- 
tors in the economy during a recession is essential to appropriate diagnosis 
and prescription of policies to encourage revival. 

For this purpose, too, the type of measure presented here may well be ex- 
tended to other data of strategic interest from a policy standpoint. Data on 
comparative changes in personal and corporate income tax payments, in un- 
employment benefits, in federal and in state and local expenditures, in public 
works contracts, in interest rates, in the money supply, and in Federal Reserve 
operations would enable one to appraise the strength and timing of either de- 
liberate or “built-in” stabilization policies. The simple technique illustrated in 
this report can thus be adapted to provide an up-to-date, objective set of facts 
on which to judge not only the severity, scope, and unique character of a de- 
veloping recession, but also the prospects for an early recovery and the vigor 
with which steps are being taken to bring recovery about. 

In order to facilitate the application of this analysis an electronic computer 
program has been prepared for the IBM 704 which computes the percentage 
changes from peak month to the first, second, third and up to the twenty-fourth 
month after peak, for any given series and for any given list of peak dates, The 
program also makes a similar set of computations of percentage changes from 
trough months, so that it can be used to compare cyclical revivals as well as 
recessions. In addition, the total percentage change from peak to trough and 
trough to peak is computed, to provide a measure of the full amplitude of 
cyclical swings. Appendix A is, in fact, a photocopy of the print-out provided 
by the program (total peak to trough changes are omitted for lack of space). 


6. 1957-1958 RECESSION 


How does the current contraction in business activity compare with earlier 
contractions when measured by the methods described above? As already noted, 
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such a comparison requires that the date of the peak from which the contraction 
began be established, at least on a tentative basis. Accordingly, July 1957 was 
selected as the monthly business cycle peak date, and the third quarter of 1957 
as the quarterly peak date. This determination was made in October 1957, when 
data through September were available for the more important economic series. 

The selection of a peak date was difficult because a moderate decline in out- 
put and employment in manufacturing had been under way since late in 1956, 
while activity in most other sectors continued to advance. Indeed, some im- 
portant factors, such as total personal income, had scarcely begun to decline 
by September 1957. Because of the continued rise in prices and wages, physical 
volume series on output, income, and trade reached peaks earlier than the 
corresponding value series. In general, during the first half of 1957 the physical 
volume of labor input (manhours) and output of the economy at large remained 
nearly constant or at best gently rising, while greater increases occurred in the 
pecuniary volume of output, trade, and income. Although declines in activity 
became widespread between August and September and have been extended 
since, it is difficult to say whether July or August should be considered the 
zenith. Subsequent revisions of the data may shift the weight of evidence to 
August, or, less likely, to an earlier month. It must be observed, too, that at 
the time the peak date was selected, the contraction had not yet become suf- 
ficiently pronounced, or lasted long enough, to qualify as a business cycle con- 
traction in the National Bureau’s chronology. The judgment that it was likely 
to become so has, of course, since been validated by events. 

Once the peak date was determined, we could construct the tables in Ap- 
pendix A that show the extent of the decline after the July 1957 peak for many 
economic series, and compare this decline with what occurred during similar 
intervals of time in previous business contractions. The first comparisons were 
with the two relatively moderate postwar contractions, 1948-49 and 1953-54, 
and the two severe prewar contractions that began in 1929 and 1937. Later, 
with the aid of the computational program developed for the IBM 704 elec- 
tronic computer, we extended the comparisons to earlier cycles and made the 
computations in several variant forms. 

Appendix A presents the record of what these comparisons have shown about 
the current recession as it developed, and how it stood at the time this report 
went to press. Drawing upon that record, Table 284 compares the percentage 
changes for the first seven months of the current recession, i.e., July 1957 to 
February 1958, with those for the corresponding periods of the preceding re- 
cessions. The summary columns at the right show how many of the indicators 
experienced a smaller decline in this recession than in the earlier ones, and how 
many a larger decline. Both the indicators of aggregate activity and the leading 
indicators are decisive in recording a substantially smaller decline in 1957-58 
than in the severe contractions of 1937-38 and 1929-30. In comparison with 
the severe contraction of 1920-21 the leading indicators also show a substan- 
tially smaller decline, while most of the aggregates show a larger decline cur- 
rently. Historically, the leaders have been a more reliable guide at this stage 
(seven months after the peak), and hence are entitled to greater weight. Thus, 
our evidence points to a less severe contraction on this occasion than those ex- 
perienced in 1920-21, 1929-30, or 1937-38. 
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On the other hand, nearly all the leading series, and a majority of the ag- 
gregate indicators, registered larger declines in the seven months since July 
1957 than in the corresponding periods of each of the four milder business con- 
tractions, namely, 1926-27, 1953-54, 1948-49, and 1923-24. The 1957-58 con- 
traction appears, therefore, to be of intermediate proportions, not the mildest 
on record nor yet the most severe (see Chart 287). The rankings shown in the 
bottom section of the table confirm this indication." 

This evaluation of the severity of the 1957-58 contraction began emerging 
from the leading series when data became available for the fourth month after 
the July peak, i.e. for November 1957 (Tables 288 and 289). As we have seen in 
the preceding pages, a historical 1nking of the recessions based on changes in 
the leading series during the first four months accords fairly well with their 
ultimate ranking according to severity. Hence when it became apparent (in 
late December) from data covering the four-month span July—-November 1957 
that declines in most of the leading series were larger than in the milder re- 
cessions in our list but smaller than in the most severe recessions, this bit of 
evidence helped to support other indications pointing to a recession of this gen- 
eral character. Data for five-, six-, seven and eight-month spans have provided 
further support. 

On the other hand, the historical analysis of the indicators of aggregate eco- 
nomic activity suggested that a ranking of the recessions even moderately 
consistent with their ultimate severity would not emerge until at least six 
months had elapsed. Thus a comparison of the declines in these aggregates in 
the current recession with their declines in earlier recessions was not likely to 
yield consistent results during the first six months. This was indeed the case. 
Changes in the aggregates over a six-month span showed that the current de- 
cline was substantially smaller than in the severe contractions of 1929 and 1937, 
and slightly smaller than in the mild contractions of 1926 and 1953. But most 
of them also showed larger declines than in the contractions of 1923 and 1920, 
which rank fourth and fifth respectively in our list according to severity. This 
inconsistency is not likely to be erased until data for the aggregative indicators 
for the first nine months (i.e., through April 1958) are available. If the leading 
series turn out to be a reliable guide in this respect, the aggregates should then 
show larger declines than in the 1926-27, 1953-54, 1948-49, and 1923-24 con- 
tractions but smaller declines than in the 1920-21, 1937-38, and 1929-30 con- 
tractions. 

The business contraction that began in mid-1957 rapidly engulfed most of 
our indicators, as its predecessors had done also. Table 278 shows that by 
October, the third month after the peak, eight of the ten aggregates and eight 
of the ten leading series had moved down from their levels at the peak. The 





4 If August 1957 were selected as the busi cycle peak instead of July, the current contraction would appear 
somewhat more severe relative to the earlier contractions, because the six-month decline from August to February 
in many of the indicators of aggregate activity would be nearly the same as the seven-month decline from July to 
February (since the three-month averages centered on July aud August would be nearly alike), but the decline would 
be compared with six-month declines in the earlier contractions instead of with seven-month declines, The average 
ranks for the aggregate indicators in Table 284 would be only slightly altered, however, and those for the leading 
series scarcely changed at all. Thus the shift would not affect the conclusions given above. 
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Cuart 287. Comparative changes during first eight months of 
business cycle contractions, selected indicators. 
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Percentage changes are computed from the three-month average of each indicator centered on the business 
cycle peak. The eight-month intervals are: Jan. 1920-Sept. 1920, May 1923-Jan. 1924, Oct. 1926-June 1927, 
Aug. 1929-Apr. 1930, May 1937-Jan. 1938, Nov. 1948-July 1949, July 1953—Mar. 1954, July 1957—Mar. 1958. 


wide scope of the decline was maintained in succeeding months. More recently, 
a slight narrowing of the scope of the contraction has appeared, especially in 
the leading series, when the measurement is made in terms of changes over a 
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three-month span (see Chart 290).% The improvement has not yet carried as 
far as the similar improvement had at the end of the first six months of 1953- 
54 contraction. At that time (January 1954) four of the ten leading series had 
already moved above their level at the peak, whereas none had achieved this 
position by January 1958. Moreover, as is evident from the chart, reversals in 
these measures are not infrequent. Nevertheless, if the modest improvement 
that has occurred is sustained and extended, it may signal the beginning of the 
end of the 1957—58 contraction. 


TABLE 289 

RANKING OF 1957-1958 CONTRACTION IN RELATION TO SEVEN EARLIER 
BUSINESS CYCLE CONTRACTIONS, FOR SUCCESSIVE MONTHS 

AFTER PEAKS, TWO GROUPS OF INDICATORS 
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Source: Tables 275 and 284, and Appendix A. The seven business contractions before 1957-1958 are arrayed 
from left to right according to their over-all severity (see text). A rank of 1 indicates a small decline, a rank of 8 a 
large decline. 

* Nonagricultural employment, gross national product, industrial production, bank debits, personal income, 
corporate profits. Not all series are available for all cycles—see Table 284. 

> See Table 275. Correlation coefficients are based on contractions prior to 1957-58. 

© See Table 276. 


It is well, however, to recall that in business contractions of the dimension 
that the 1957—58 contraction has so far exhibited, the interval from the business 
cycle peak to the date when economic activity in general regained the peak 
level has been at least a year and a half. If the contraction turns out to be more 
severe than any of the four mild or moderate contractions since 1920, as is 
presently suggested by the leading indicators, this interval may be exceeded. 





8 Julius Shiskin pointed out (in February 1958) a similar development in several diffusion indexes compiled 
at the Bureau of the Census for the Council of Economic Advisers, and it has appeared in some other diffusion 
indexes compiled at the National Bureau of Economic Research. These indexes show, for example, that there has 
been a slight increase in the number of manufacturing industries reporting an advance in the workweek since 
autumn 1957, and a slight increase in the number reporting a rise in new orders. 
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On the other hand, these measures do not suggest that the period of less than 
peak activity will extend to the two and a half years that characterized the 
severe contractions of 1920-21 and 1937-38. In any event, a great deal depends 
on the governmental and private actions undertaken to bring about an early 
and rapid recovery. 

Some of the distinctive characteristics of the 1957-58 contraction, as it has 
developed so far, may be observed in Table 284. One of the outstanding facts is 
the relatively small decline in personal income. The decline of 1.3 per cent, 
July 1957 to February 1958, is only one-sixth as large as the 8.1 per cent de- 
clines during the first seven months of the 1937 and 1929 contractions. Yet the 
current decline in employment is more than half as large as in 1937 and 1929, 
and the current decline in gross national product is also a substantial fraction 
of the corresponding declines in 1937 and 1929. The greater stability of personal 
income is partly attributable to the larger role of unemployment compensation 
and other transfer payments, partly to the growth of employment in govern- 
ment, distributive trades, and other service industries where rates of pay and 
the volume of employment are typically more stable.'* In terms of disposable 
personal income (that is, income after federal income taxes), the contrast be- 
tween the current recession and those of the thirties would be even greater. 

Another unusual and perhaps not unrelated feature of the 1957-58 recession 
is the fact that the wholesale price index after eight months of recession is still 
above its level when the recession began. Except for 1920, this is the only re- 
cession in which this index, which covers all commodities except farm products 
and foods, has not declined, although in 1953-54 the decline during the first 
eight months was minute. In the case of the consumers price index, the rise 
during the current recession is less exceptional, as the following figures indicate: 








| 


Percentage Change during 





Business | First Eight Months First Twelve Months 








Cycle |- — Seo . 
Contraction, Basic | Wholesale | Con- Basic Wholesale Con- 
beginning | Commodity | Price Index, | sumers’ | Commodity | Price Index, | sumers’ 
Price | Excl. Farm Price Price Excl. Farm | Price 
Index | & Food Index Index & Food Index 


= o — — - | —_— 


July 1957 +0. | +2.6 





| 





Oct. 1926 
July 1953 
Nov. 1948 | 
May 1923 
Jan. 1920 
May 1937 
Aug. 1929 


io 6) 


—6. | +0.6 | +4. 
—0.: +0.: +4. 


o 


—5.9 
—0.2 
—5.1 
—7.0 
--18.6 
—5.3 


—8.6 





-—we 





| 
bo 


—5. —2.1 | —22 
—4, | +1.2 | -8. 
+9. +3.8 | —39 

| 8. -0.9 | 32. 

|} —4. | 2.7 |. =e. 

| 


| 
— oe 
— 


wNowonht & tO Nw 


a a 
ee ee 








16 Cf. Daniel Creamer, Personal Income during Business Cycles (National Bureau of Economic Research, 1956). 
Another factor is the secular decline in the proportion of total income derived from farming, which has in the past 
declined more sharply than total nonfarm income during business recessions. Indeed, in the current recession farm 
income has been increasing rather than declining. The increasing number of wage contracts that tie rates of pay to 
the cost of living, which has risen in the first eight months of this recession, may have imparted some stability to 
incomes currently, although this hinges on the question whether the resulting increases in wage rates may not have 
brought about some reduction in hours or in employment. 
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On the other hand, the price index of basic commodities (cotton, wool, copper, 
steel scrap, etc.) has declined during the first eight months of the current re- 
cession as it has in every other recession except 1953-54. 

In the more severe recessions, all three price indexes declined further in the 
first twelve months than they had in the first eight. But in the milder recessions 
an interesting gradation appears. The basic commodity price index showed a 
smaller decline or an actual increase after twelve months in the 1926-27, 1953- 
54, and 1948-49 recessions. The decline in the wholesale price index remained 
about the same. The consumers’ price index showed further weakness, so that 
the declines after twelve months were somewhat greater than after eight (or the 
increases were less), even in the mild recessions. Thus our figures reflect some 
of the lagging relations among prices, as well as the influence of mild or deep 
recessions upon the entire price structure. 

Other facets of the 1957-58 contraction are well worth study and reflection 
when set against the corresponding pattern of events in previous contractions. 
The “forward look” that these previous contractions provide is illuminating. 
We have been able to touch on ouly a few of the many strategic economic 
variables and relationships that can usefully be analyzed during the course of a 
business contraction, a fact that underlines the tentative and preliminary char- 
acter of the experiments reported above. 
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APPENDIX A 


ACTUAL IBM 704 PRINT-OUT SHOWING STANDINGS AND PERCENTAGE 
CHANGES AFTER BUSINESS CYCLE PEAKS, TWENTY 
ECONOMIC INDICATORS 


Nore: All series except Wholesale Prices ex Farm Products and Food, Basic Com- 
modity Prices, and Index of Industrial Stock Prices are adjusted for seasonal variations. 
The peak standings are three-month averages centered on the business cycle peak month. 
Quarterly data are shown as of the mid-month of the quarter, and peak standings are for 
the business cycle peak quarter. The units in which the standings are expressed are given 
at the end of the table, together with brief source notes. An asterisk at the peak standing 
indicates the beginning of a new segment of the series, identified in the notes. For sources 
of current data and current seasonal factors see Appendix B. 


NONAGRICULTURAL EMPLOYMENT 


STANDINGS FIRST YEAR AFTER PEAK 
AT PEAK +4MO0 +5 M0 *6MO0 #7MO0 + 8MO + 9 MO +11 MO 


3090,3 3004 2969 2948 2923 2910 2890 28631 

296747 2974 2948 2901 2838 2778 2757 2718 
41429_7* 40731 40447 40047 38056 38034 38425 39320 
4467843 43726 43554 43342 43216 43030 43107 42455 
49875.0 49344 49156 48859 48714 48506 48407 48274 
52810,7 52218 51980 $1709 51035 $0732 

STANOINGS SECOND YEAR AFTER PEAK 

AT PEAK #16 MO +17 MO +18 MO #19 MO +20 MO +21 MO 

3090.3 2685 2664 2647 2629 2625 2606 

2967 e7 2713 2729 2758 2784 2789 2795 

41429,7# 41298 41705 42195 42437 42588 428655 

4467843 43333 43707 44068 44558 44992 45589 

4987540 48600 48756 46827 48909 49232 49461 


5281007 


PER CENT CHANGES FIRST YEAR AFTER PEAK 
DATE AT PEaK +1M™0 +20 #3M0 #4 M0 +5 M0 +6M0 +7 MO + 8 MO + 9 MO 
1929 8 309003 -e27 -e75 -1250 -2079 -3493 ~4e61 ~5e41 5084 -6048 
1937 5 296767 e21 069 065 e21 ~066 -2025 4937 -6039 -7910 
1945 2 41429,7* -ell “078 ~1043 71069 -2037 3034 ~8e14 -8e20 ~7025 
1948 11 44678,3 27 -1206 -1e63 -2013 -2052 72099 -3027 3069 -3,52 
1953 7 49875,0 -e07 236 -048 1206 ~1e44 -2204 72033 —2074 -2694 


1957, 7 528107 206 -065 ~1lel2 -1¢57 -2009 3036 3094 


CENT CHANGES SECOND YEAR AFTER PEAK 
AT PEAK +13 MO +14 MO #16 MO +17 MO +18 MO +19 MO #20 MO +21 MO 
309043 -10053 -11e43 -13012 -13680 -14035 -14093 <-15006 1567 
2967,7 -1020 -9e96 -6.58 -8404 -72006 ~6019 -6002 5982 
41429, -3270 ~2042 ~032 266 1085 2043 2080 3044 
4467843 3057 -3075 -3201 -2017 -1e37 -e27 270 2004 


49875,.0 3946 -3036 ~2056 -2024 -2010 1094 -1029 -283 
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UNEMPLOYME!; RATE 
STANDINGS 
*2™”0 «30 +4 MO + 5 MO 
231 189 208 221 
1097 1114 1163 1219 
446 473 492 518 
290 308 354 451 
4s3 “70 493 $03 
STANDINGS 
#14 MO +15 MO #16 MO +17 MO 
901 1078 11869 1118 
1958 1918 1860 1822 
665 644 627 584 
612 573 535 499 


+14 MO 
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269.0 


34943 


ABSOLUTE CHANGES 
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500 4aae 
470 479 


FIRST YEAR AFTER PEAK 


+60 +7 M0 
25420 30500 
21763 43863 
22660 19360 
22163 26703 
15343 24363 
SECOND YEAR 
#18 MO +19 MO 
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AT PEAK 
8284.0 
928400 
1055640 
925200 
2212.0* 
2660.0 
3674.0 


440040 


AT PEAK 
8284.60 
928469 
1055600 
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AT PEAK 
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GROSS NATIONAL PRODUCT 


«3 MO 
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2237 
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4326 


+15 m0 
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PER CENT CHANGES 
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STANDINGS FIRST YEAR AFTER PEAK 
* 6 MO +9 0 
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STANDINGS SECOND YEAR AFTER PEAK 
+18 MO +21 MO 
6288 8900 
9908 9248 
6068 7948 
8856 
2152 2201 
2759 2944 
3677 3790 
FIRST YEAR AFTER PEAK 
+ 6 MO +9 m0 
-.97 2075 
1.08 047 
-5053 ~7,62 
-7082 11076 
4007 -10.89 
-2058 -2484 
-2069 -29523 
3041 
SECOND YEAR AFTER PEAK 
+18 MO +21 MO 
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6072 239 
23957 “24471 
~4e28 
-207) 250 
452 11,51 
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+12 MO 
8448 
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INDUSTRIAL PRODUCTION 


STANDINGS FIRST YEAR AFTER PEAK 

DATE AT PEAK #1M0 #2M0 ¢©36m0 ¢ 4™O +5 M0 +6MO0 #+7M0 ¢ 8 MC + 9MO +10 MO 411 MO +12 MO 
1920 } 42.7 44 43 41 42 42 41 41 40 39 35 33 31 
1923 $5 4920 49 48 48 47 46 46 45 46 4? 46 45 43 
1926 10 32,7 52 52 52 52 53 52 52 52 52 51 50 50 
1929 8 69.7 60 59 56 54 54 54 53 53 5) 50 48 46 
1937 $ 64,7 64 65 64 62 57 52 47? 46 45 45 “4 43 
1945 2 124,7 125 122 119 117 lll 99 89 86 89 67 65 81 
1948 11 1036? 102 101 9° 97 96 95 95 94 %6 97 94 96 
1953 7? 13663 136 133 132 wd 126 125 125 123 123 125 124 123 
1957 7? 145.0 145 144 142 139 135 133 130 128 

STANDINGS SECOND YEAR AFTER PEAK 

OATE AT PEAK +13 MO +414 MD) #15 MO +16 MO 417 MO +18 MO #19 MO +20 MO +21 MO +22 MO +23 MO +24 MO 
1920 1 427 31 30 $0 $1 30 30 31 31 35 33 33 34 
1923 § 494.0 41 40 42 as 44 45 47 48 4eé 48 48 48 
1926 10 S267 49 50 51 51 52 51 52 53 $3 54 55 56 
1929 8 6067 46 “5 44 43 42 42 43 43 43 42 41 40 
1937 § 64,7 44 46 49 .. 50 51 54 54 $5 54 54 53 53 
1945 2 124,7 69 87 84 90 91 94 95 96 97 96 $9 99 
1948 11 103.7 98 100 99 102 106 110 112 115 120 120 121 120 
1953 7 136,43 123 124 126 128 130 132 133 135 136 138 139 139 
1957 7? 145.0 

PER CENT CHANGES FIRST YEAR AFTER PEAK 


OaTe AT PEAK +1MO +2 m0 *3 M0 +4MO0 #360 +60 *+7M0 #¢8M0 +9 MO +10 MO 411 MO +412 MO 


1920 4} 42.7 3013 078 -3291 “1456 71256 -3091 -3091 ~6025 “8059 -17097 -22066 ~27636 
1923 5 49.0 200 ~2204 -2204 ~4208 -6012 ~6412 ~8e16 ~6012 =4208 ~6012 “8016 =-12024 
1926 10 5267 1027 1027 1027 ~le27 63 ~le27 ~1e27 =le2? -3416 “3016 5006 5206 
1929 ®8 60.7 1.10 2075 “7069 -10099 =10699 ~10.99 712064 -12064 -45,93 <-17458 “20088 ~-24018 
1937 5 64.7 ~1.03 052 ~1403 ~Sel2 =11686 =-19059 -27,32 ~28087 -30041 ~30061 -31496 -33e50 
1945 2 124,7 e27? 2014 ~4554 “6015S -10096 ~20059 -28,6) “31002 =28061 -30621 -31,82 -35003 
1948 11 103.7 ~leél ~2.57 ~4450 6043 7240 ~8036 8036 -9032 -7240 6043 ~9032 ~7240 
1953 7 13663 ~e24 ~2048 3018 -5.38 -7258 -8e31 ~8e31 -978 -9.78 ~8031 -9205 —9e78 
1957 7? 145.0 200 ~069 2007 ~S514 -6290 8028 -10034 ~11.72 
PER CENT CHANGES SECOND YEAR AFTER PEAK 


oaTEe AT PEAK 413 MO +16 MO 415 MO +16 MO +17 MO 418 MO +19 MO +420 MO #21 MO 422 MO #23 MO 424 MO 


1920 3 42,7 “27930 =29469 -29.69 -27,34 29069 =29969 -27534 27034 =22066 =22066 =-22566 -20031 
1923 5 49.0 16033) -18037 =14029 -10.20 ~-10.20 ~8016 ~4208 -2004 72004 8 =2004 ~2004 =2004 
1926 10 52.7 6296 =5206 -3.16 ~3016 “4.27 ~3016 -1e27 063 «63 2083 4043 6033 
1929 8. 60.7 24018 -25082 '-27647 =29512 “3007 -30077 =29612 ~29.12 “29012 =30077 =32042 ~34007 
1937 «65 64,7 “31096 -28.87 -24.23 -22.68 “21013-16049 -16.49 ~-14,95 “16049 =16649 =18504 ~184046 


1945 2 124,7 728061 ~30021 -32662 -27681 -27.00 24060 =-23680 -22.99 -22,.19 “22099 =20059 =20059 
1948 1) 103.7 “5047 3054 4250 1661 2025 6611 8404 10.93 15.76 15076 16072 15076 
1953 7 136.3 ~978 -9205 -7.58 ~6011 ~4065 -3.18 2044 “098 ~924 1022 1096 1496 
1957 7 145,0 
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FREIGHT CARS LOADED 


STANDINGS FIRST YEAR AFTER PEAK 

DATE AT PEAK + 2 MO + 2M0 #3M0 #¢4 MO +5 MO + 6M0 + 7M0 + 6MO +9 M0 +10 MO #11 MO +12 MO 
1920 1 87563 872 683 811 880 8768 650 889 856 874 658 830 793 
1923 5 97940 992 985 956 934 939 964 951 944 976 954 910 886 
1926 10 1036,0 1042 986 1037 1026 1040 1009 994 1001 978 984 785 966 
1929 8 103043 1023 1009 954 959 933 943 916 943 910 912 889 857 
1937 5 77720 753 756 743 723 680 628 $86 613 §73 $66 560 557 
1945 2 839.3 852 853 834 840 837 766 768 722 782 770 784 803 
1948 11 Blle? 794 773 760 704 778 752 687 684 678 634 527 659 
1953 7 756.0 754 734 717 692 649 676 668 647 636 661 638 658 
1957 7 69663 700 660 649 621 624 590 574 $77 

STANDINGS SECOND YEAR AFTER PEAK 
DATE AT PEAK +13 MO +14 MO 415 MO 416 MO +17 MO +18 MO #19 MO +20 MO +21 MO #22 MO #23 MO +24 MO 
192° (1 875,3 770 720 784 776 747 708 743 745 810 742 748 782 
1923 5 97920 888 881 889 934 957 976 999 979 974 965 971 961 
1926 10 103620 926 936 962 974 988 974 978 977 986 982 1007 1019 
1929 8 1030.3 840 833 798 800 767 762 766 777 740 733 738 686 
1937 5 777,40 552 574 579 585 613 617 632 622 607 613 385 603 
1945 2 83%—3 842 658 630 818 652 836 829 830 616 648 871 677 
1948 11 Blle? 682 650 629 725 726 717 761 748 790 794 795 788 
1953 7? 75600 633 636 649 665 685 680 695 700 703 746 716 754 
1957 7 696863 
PER CENT CHANGES FIRST YEAR AFTER PEAK 


DATE AT PEAK #1M0 +#2M0 #3M60 #4 M60 +9M0 +6M0 ©7M0 #860 +9 MO 410 MO 412 MO 412 MO 
1920 1 67563 -038 288 ~7035 53 230 -2089 1056 -20e21 ~el5 1098 5018 94) 
1923 5 979.0 1.33 261 -2035 —4460 -4409 1653 2086 -3.5? -e3l -2055 ~7605 -9050 
1926 10 1036.0 258 4.83 210 -e77? 239 -2061 4.05 -3038 ~5260 3202 4992 ~6076 
1929 6 1030.3 ~e7l -2207 ~Te4l 6092 ~9e45 6048 =-11210 -8048 11068 -11048 -13072 -16082 
1937 5 77700 -3409 -2e70 4038 —6095 -12048 =-19018 ~-24058 =21lell =-26025 -27018 <-27093 -28031 
1945 2 839.3 1e51 1663 -064 208 -028 ~8e74 8050 ~135098 -6e83 ~8026 6059 ~4033 
1948 11 811.7 2016 4676 6036 =-13626 ~4e15 ~7035 =-15036 =15073 =-16047 =-21089 35007 ~18081 
1953 7 756.0 ~026 -2091 =5016 ~Be47 -14015 -10058 -—Lle64 -14042 -19087 =<-12057 -150e61 -12096 
1957? 7 69643 224 5449 =7¢06 =-11007 -1006% -15651 =17080 =-17087 


PER CENT CHANGES SECOND YEAR AFTER PEAK 

DATE AT PEAK +413 MO +14 MO +15 MO 416 MO +17 MO +18 MO #19 MO #20 MO #21 MO +22 MO 423 MO +24 MO 
1920 1 875.43 12003 1707S =-10043 <-11035 -14066 -19012 -15012 14089 —7e46 -15023 -14055 -10266 
1923 5 97940 “9029 =-10001 -9019 4260 72025 ~e3} 2204 200 ~o5i -1043 -082 ~1e64 
1926 10 1036.0 10062 -9265 -7el4 -5298 ~4063 3298 5260 ~5069 ~4,63 ~5e21 -2280 ~1064 
1929 8 1030.3 “18947 -19e15 =-22055 -22035 25056 —2600% 25965 -24959 -28018 -28066 -28037 -33042 
1937 § 777.0 —286096 -26013 -25048 24071 —2lell =-20059 -18066 =-19095 <-21¢88 -2lell -24071 ~22039 
1945 2 83963 o32 21060 24094 -2054 1e51 “240 1023 ~lell -2078 1003 3077 4049 
19486 11 611.67 “15098 =-19092 =-22050 =10068 =-10055 -11066 6024 -7e84 ~2067 -2018 -2005 -2092 
1953 7 756.0 “16027 =15087 -14015 ~-12204 “9039 =-10005 -8.07 -Te4l -7201 ~1e32 -5029 -026 
1957 7 69863 
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SANK DEBITS OUTSIDE NEW YORK CITY 












STANDINGS FIRST YEAR AFTER PEAK 
DATE AT PEAK +10 +2M0 +3 MO +40 +5 M0 © 6 MO +7M0 + 6MO +90 410 MO 411 MO 432 MO 


1920 1 20107 199 203 205 199 203 208 203 206 197 199 186 173 
1923 5 193.0 193 182 186 180 187 189 185 187 197 185 191 188 
1926 10 22243 225 227 224 234 233 234 231 231 227 239 244 232 
1929 6 291,40 284 298 297 249 2465 242 243 241 246 239 229 227 
1937 5 198.0 196 198 196 199 193 188 182 173 166 166 166 163 
1945 2 445.3% 454 426 463 512 43 439 419 44) 463 a7s 465 472 
1948 11 66843 668 650 631 647 643 635 629 628 618 63? 637 629 
i9ss 7 93120 936 947 946 943 947 920 943 949 943 931 a7 946 
1957 7 325667 1285 1207 1195 1212 1243 1201 1197 
STANDINGS SECOND YEAR AFTER PEAK 

DATE AT PEAK +12 MO #14 MO +15 MO +16 MQ +17 MO +18 MO +19 MO 420 MO +21 MO 422 MO 423 MO +26 MO 
1920 1 20167 164 160 161 155 157 152 160 161 158 162 158 153 
1923 5 193.0 181 187 193 190 194 196 202 212 209 206 204 206 
1926 10 222.3 248 245 238 245 250 250 266 262 237 254 256 256 
1949 8 291.0 222 219 205 214 207 192 188 194 191 186 182 179 
1937 5 198.0 169 164 170 174 176 175 185 175 * = 169 175 170 182 
1945 2 44553" 480 491 496 510 517 $21 540 528 $41 $55 $51 $6? 
1948 11 668.2 650 654 658 670 676 706 725 730 772 768 765 TAs 
1983 7 95140 961 951 954 98) 1032 1008 1019 1025 1038 1085 1061 1071 


19587 7 125667 


PER CENT CHANGES FIRST YEAR APTER PEAK 

oate av Peak +160 © 260 #30 ¢©4MO0 © 5 M0 *©6M0 + 7MO0 ¢8MO0 © 9 M0 410 MO ¢11 MO +12 MO 
1920 2 20167 ~1032 066 1665 1032 066 3014 66 2015 ~2e31 ~1e32 ~Te7? -14021 
i923 5 193,0 000 = $5070) =— 3063) = 6 TH -3ell -2007 “4015 ~3ell 2407 ~4e15 12004 -2059 
1926 10 222.3 1420 2210 75 5.25 4.80 5025 3490 3290 2210 7250 9e74 435 
1929 8 291.0 2.41 2041 2006 =14043 =15081 =-16084 =-16049 <-17018 =15646 -17067 <-21031 ~21099 


1937 5 198.0 1201 200 -1e01 250 ~2052 5205 “8008 -12063 ~-16016 <-16016 -16016 ~17068 
1945 2 445,3* 1095 4079 3097 14697 -2054 ~les2 5091 “097 3697 6021 4042 5299 
1946 11 668.3 -205 ~2e74% 5059 =-3019 =3079 4099 5088 6203 ~7053 504s 4069 -5083 
1953 7 95100 «53 -042 -053 -084 ~e42 ~3026 084 ~e21 084 -2010 ~042 053 
1957 7 1256.7 2025 -3.95 -6691 3655 -1209 4043 4.75 

PER CENT CHANGES SECOND YEAR AFTER PEAK 
DATE AT PEAK 413 MO +14 MO #15 MO +16 MO +417 MO #18 MO 419 MO +20 MO #21 MO #22 MO #23 MO +24 MO 


1920 1 20167 "18068 =20066 =-20017 <-23014 <-22015 <24063 -20066 -20017 -21065 -i9067 -21065 ~-26013 
1923 5 193.0 ~6022 “Bell 000 = 1455 052 1655 4266 9056 6.29 bes 5070 6074 
1926 10 2223 11654 10019 7205 10019 12444 12044 19064 17484 6260 14024 1l4e24 15014 
, 1929 6 291.0 “23071 -2467% =-29055 =-26046 ~-28087 =-34002 -35040 -33033 -34036 =35040 -37046 -38049 
1937 5 196.0 ~14e65 =-17e17? =24014 <12012 <-1lell =11062 “6057 =-1le62 =14065 =11062 -14014 008 
1945 2 44553 7078 10025 11038 14652 16209 16099 2126 17089 21048 24063 23073 27032 
1948 11 668.3 ~2074 ~2014 —1055 225 1015 5264 8448 9023 15,51 14e91 14045 17031 
1953 7 951.20 12005 200 «31 315 6452 5299 715 7.73 9-15 10094 11097 12042 
1957 7 125647 








MEASURING RECESSIONS 299 


PERSONAL INCOME 


STANDINGS FIRST YEAR AFTER PEAK 


DATE ay Peak + 1 MO + 2 M0 +3M0 +40 +5 M0 + 6M0 +749 ¢©8M0 +9 MO 410 MO +31 MO 012 MO 
1923 5 1447.0 14686 1710 1728 1662 
1926 @8 1911.0 1867 18% 1918 1908 

fm! 1929 6 871_7* a7l 867 83e 833 821 812 801 soe 795 7A5 2 15) 
1937 5 75943 768 762 764 746 733 717 698 6A7 687 685 477 671 
1945 2 1735.7 1736 1722 1730 1751 1750 1707 1632 1668 1695 1682 1790 1693 
1948 11 2128.3 2121 2087 2082 2097 2086 2081 2042 2n38 2058 2071 2029 2980 
1953 7 2882,0 2876 2871 2874 2865 2863 2850 2853 2847 2845 2851 2889 2889 
1957 67 34624? 3473 3472 3468 2462 1436 2436 32417 34146 

STANDINGS SECOND YEAR AFTER PEAK 
DATE AT PEAK +413 MO #14 MO #15 MO +416 MO 417 MO +18 MO #19 MO +20 MO #21 MO +22 MO +23 MO +24 MO 
1923 5 1687.0 1688 1765 1795 1790 
1926 «68 191100 1927 1932 1940 1984 
ig29 66 371,7*° 745 732 716 702 693 691 728 733 664 663 682 626 
1937 5 75993 675 677 686 690 692 698 701 710 710 717 711 718 
1945 2 1735,7 1726 1742 1753 1773 1819 1626 1788 1840 1839 16586 1867 18469 
1948 11 212643 2082 2146 2175 2235 2194 2205 2222 2256 2294 2330 2352 2372 
1953 7 288220 2872 28°% 2895 2921 2935 2935 2942 2966 3009 3039 2051 30865 

1987 7 344202 

PER CENT CHANGES FIRST YEAR AFTER PEsK 

DATE AT PEAK + 1MO0 #20 +30 +40 ©5 M0 ©6M0 © 7M0 © 8M0 #©9M0 010 40 411 MO 412 MO 
1923 5 1687,0 ~206 1036 2225 1048 
1926 8 1911.20 -2030 -078 37 -016 
1929 86 87le7* ~08 ~*53 -3086 ~4e44s 5081 ~6084 -8ell -7230 ~8.80 “909% -12058 -13084 
1037 5 759.3 lel4 «35 61 -1e76 -3447 ~5e57 -@.08 ~%e53 -9¢53 ~[e79 -10084 -11063 
1945 2 173567 002 -079 033 288 +83 1065 -Se97 3290 ~2034 -3209 -2205 2046 
196 11 212863 034 1094 -2018 ~1le4? ~1e99 -2022 -27207 ~4e24 -2430 -2069 -6067 3068 
1953 7 268240 ~e21 238 228 259 066 -lell 1001 1021 1028 -1208 280 -080 
1957 7 346263 el 228 016 -201 ~076 -076 -le3l 1040 

PER CENT CHANGES SECOND YEAR AFTER PEAK 

DATE AT PEAK 413 ”0 414 “0 +415 MO #16 “O #17 MO +18 MO +19 MO +70 MO +2] MO #22 “0 #23 40 +26 MO 

s 68700 006 4062 6240 Sel} 

1926 8 191120 284 1210 1252 Beh? 
1929 6 BIle7?*® -24053 ~-16002 -17086 -19046 -20050 -20073 -16048 <-15091 =-21653 -2309% ~25020 -27095 
1937 5 75963 —Llell +1084 ~2e66 -Se13 -68 67 -8.08 -7068 -6.50 ~6250 -$057 6036 —~5ee4 
1965 2 173567 056 036 1200 2015 4260 5020 3002 6201 5.95 7205 7057 7268 
19486 11 2126.3 2018 283 2+19 $201 2209 3260 4040 5690 7.78 2448 10051 11045 


1953 7 2862.0 ~e35 42 «45 1235 1084 1084 2208 2091 4.41 5045 5086 7204 


1957 7 346242 
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RETAIL STORE SALES 


STANDINGS FIRST YEAR AFTER PEAK 


1957 


DATE 

1920 1 
1923 5 
1926 10 
1929 6 
1937 5 
1945 2 
1948 11 
1953 7 


1957 7 


DATE 
1920 1 
1923 (5 
1926 10 
1929 8 
1937 5 
1945 2 
1948 11 
1953 7 


1957 7 


5 
2 
11 
7 


7 


AT PEAK 


3120 


3843 
3583,3* 
6256,0 

11014,7 
14184,3 


1694940 


AT PEAK 
31.0 


34,3 


3583,3# 

6256.0 
11014,7 
1418493 


1694940 


AT PEAK 
31.0 
3463 
37,0 
3863 

3563,3* 

6256.9 

11014,.7 

1418493 


1694940 


AT PEAK 
31,0 
3463 
37.0 
3843 

35863,3* 

6256.0 

11014.7 

14184,3 


1694940 


* 1 M™0 
33 

35 

37 

39 
3553 
6346 
11146 
13939 


17030 


+13 MO 
32 
34 
37 


35 


7797 
10801 


14006 


+ 2 M0 
32 

34 

37 

38 
3601 
6109 
10694 
13634 


16919 


+14 MO 
31 

34 

37 

35 
3116 
6076 
11075 


14070 


-8270 
-13.04 
29409 


55 


+ 3 MO 
32 
35 
37 


38 


8334 
11309 


13946 


* @ M0 


3648 
6404 
10888 
13988 


16562 


+ 5 MO 
33 

35 

37 

37 
3593 
6572 
11060 
13606 


16855 


STANDINGS 


+16 MO 
30 

34 

37 

33 
3192 
8256 
11395 


14233 


+17 MO 
30 

33 

36 

34 
3180 
8679 
11432 


14995 


PER CENT CHANGES 


+3 M0 
3023 
1694 


200 


200 
-3,88 
200 
-11230 
11067 
33022 
2067 
-1268 


+ 4 M0 
9068 
~e97 

200 
~087 
1480 
2037 

-1015 
-1038 
-2028 


* 5 MO 


£e45 


GENT CHANGES 


+16 MO 
~3023 
~e97 
200 
-13691 
=10292 
31097 
3445 
234 


+17 MO 
-3023 
3288 
-2070 
-11430 
-11026 
38673 

3079 

5072 


+ 6 MO 
34 

34 

36 

37 
3346 
6467 
10965 
13512 


16718 


+ 7 ™O 
33 

34 

36 

37 
3254 
6497 
10940 
13836 


16115 


SECOND YEAR 
+218 4O +19 MO 


20 
35 
36 


34 


29 
34 
37 
36 
3344 
9040 
12069 


1463] 


+ 8 MO 
33 

35 

36 

36 
3262 
6835 
10742 
13792 


15928 


+9 

31 

35 

37 

36 
3174 
7065 
10855 


14146 


AFTER PEAK 


+20 MO 
28 

34 

37 

35 
3390 
9136 
12985 


14936 


#21 MO 
29 

35 

38 

33 
3361 
9341 
12965 


15131 


FIRST YEAR AFTER PEAK 


+6 M0 
9068 
097 
“2070 
~3048 
6062 
3037 
~045 
~4e74 


1036 


+ 7 MO 
6045 
-e97 

-2070 
-3048 
-9019 
3085 
068 
2046 


4692 


+ 8 M0 
6045 
1694 
-2070 
~6209 
-8297 

9025 
2048 
2077 


-6002 


+9 M0 
200 
1694 
200 
-6209 
~11e42 
12.93 
~1645 


-e27 


SECOND YEAR AFTER PEAK 


+18 MO 
-3023 
1694 
-2070 
-11030 
~8041 
46655 
5082 
3089 


+19 MO 
6045 
097 
200 
-112030 
6068 
44.50 
9057 
3015 


+20 MO 
9268 
“97 
200 
-6.70 
5039 
46204 
17289 
5230 


+21 MO 
6045 
1.94 
2270 
“13291 
5065 
49.31 
17.71 
6.67 


+10 MO 
33 
34 
38 
35 


+22 moO 
29 

35 

37 

33 
3419 
9447 
12381 
15232 


+10 MO 
6045 
~097 
+70 
8270 
-12296 
12092 
e47 


-1097 


200 
713691 
4059 
51001 
12040 
7039 


+23 Oo 
29 

35 

39 

32 
3416 
9367 
12043 
15207 


200 
97 
2970 
8670 
13943 
20024 
-1604 
76 


+23 MO 
6045 
1694 
5040 
16052 
~4667 
49.73 
9034 


7Te21 


200 
-8270 
“15019 
22695 
046 


-028 





DATE 
1920 
1923 


1926 1 
1929 
1937 
1945 
1948 1 
1953 


1957 


1 
5 


° 
8 
5 
2 
1 
7 


7 


MEASURING RECESSIONS 


AT PEAK 
152640 
1075,0 
710,7* 
65403 
61667 
70940 
1053,7 
1145.43 


1256.3 


AT PEAK 
1526.0 
1075,0 
710,7* 
654,43 
61667 
709.0 
105347 
114543 


125643 


AT PEAK 
3526.0 
1075.0 
710,7* 
654.3 
61607 
70940 
1053,7 
1145.3 
1256.3 


AT Peak 
1526.0 
1075.40 
710.7# 
65443 
616,7 
709.0 
105267 
114563 
125603 


+ 1 M0 
1592 
1051 
709 


655 


1054 
1149 


1260 


#13 MO 
1141 
961 
665 
595 
581 
731 
1002 


1144 


+10 
4e32 
2023 
023 
210 
-027 


+13 MO 
-25023 
8,74 
6043 
~9.07 
-5.78 
3210 
-4290 


~el2 


WHOLESALE PRICES EX FARM AND FOODS 


#2 M0 
1628 
1034 
701 
655 


+ 3 M0 
1690 
1020 
690 


649 


STANDINGS 
+4MO + 5 MO 
1706 1705 
1018 1006 
685 676 
647 640 
614 609 
712 713 
1033 1020 
1145 1146 
1259 1261 


STANDINGS 

+16 MO +)7 MO 
1055 1024 
977 977 
664 662 
574 565 
582 580 


755 783 


PER CENT CHANGES 


* 3 M0 
10075 
5012 
2091 
-«81 

-ell 
28 
-1.30 
206 


13 


#4 M0 + 5 MO 


11480 11673 
~5030 6042 
-346) ~4088 
1912 


-e43 


-2019 
1024 
042 056 
-3420 

206 


037 


CENT CHANGES 
+16 MO +17 MO 
-30086 +3290 
9012 -9e12 
~6057 6085 
-12028 +1365 
5062 5495 
6049 10244 
-4043 ~4033 


223 032 


FIRST YEAR AFTER 
+7 + 


6 MO 
1734 
993 
669 
636 
602 


714 


1008 
1146 


1262 


SECOND YEAR 
+18 MO +19 MO 


992 
968 


1742 1 
995 1 
668 

631 

598 

714 
1001 
1144 1 


1258 1 


+20 
967 
1006 1 
664 


PEAK 


8 MO 


674 


024 


142° 
259 


MO 


968 


o14 


662 


FIRST YEAR AFTER PEAK 


+ 9 MO 
1580 
1035 
$68 
624 
594 
717 
1000 
1145 


AFTER PEAK 


+21 MO 
998 
1040 
662 


+6M0 #7M0 +8 M0 + 9 MO 


13063 
-7063 
5086 
-2980 
2038 

270 
~4233 
006 


045 


SECOND YEAR 
+18 MO 419 MO 


-34299 
-8.09 
~6e71 

-14042 
6059 
12055 
-3067 


«58 


14615 9 
“70440 = 
-6200 
-3957 
-3203 

«70 


-3 

-5,00 <5 

-e12 * 
o13 


+20 
-36063 -36 
6042 -5 
6057 
-15064 
6492 -7 
13012 16 
3201 -1 


1202 


-17el7 


270 


074 


-6200 
=4402 


019 
85 
028 
229 
e21 


MO 
57 


«67 


6085 


208 
078 
049 


093 


3.54 
-3072 
~6400 
4564 
-3068 
1.13 
5,09 
203 


AFTER PEAK 


+21 MO 


34,60 
—3026 


~6285 


-17.93 


6292 
21672 
e51 
1.02 


+10 MO 
1440 
1028 
670 


+11 M0 
-12065 
-5067 
5458 
-7069 
—4e97 

1069 
-5.09 

-029 


+23 MO 
~34e40 
-5e77 
6043 
-19015 
6059 
29048 
4049 


093 


#12 MO 
-18061 
6098 
5086 
8061 
5030 

2012 
5009 

-#20 


+24 MO 
356071 
-5049 
6057 
-19200 
~6059 
30004 
5092 
1072 





OATE 
1920 
1923 
1926 
1929 
1937 
1945 
1948 
1953 


1957 


2 
5 


AT PEAK 
498.0 
516.0 
$30.0 
678.0 
427.0 
102,0° 
195.0 
182.0 


204.0 


AT PEAK 
498,90 
516.0 
530.0 
678.0 
427.0 
102.0 
19540 
182,0 
204.0 


aT? Prax 
“96,0 
516.0 
530.0 
578.0 
427,0 
102.0% 
195.0 
182.0 
204.0 


AT PEAK 
49844 
516.0 
530.0 
678.0 
427.0 
10200 
195.0 
182.0 
204,0 
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CORPORATE PROFITS AFTER TAXES 


STANDINGS 


STANDINGS 


PER CENT CHANGES 


+3 0 
10.44 
~27s13 
~6023 
“17e1) 
-15069 
-5,88 
13433 
~3285 


PER CENT CHANGES 


#15 MO 


~4605) 
~26023 
-92018 
-66051 

10078 
19049 


-13074 


328 
310 
452 
394 
204 

6° 


ise 


SECOND YEAR AFTER PEAK 
#18 MO 


FIRST YEAR AFTER PEAK 
+6 M0 79 


#21 MO 


FIRST YEAR AFTER PEAK 


* 6 "0 
34,146 
=39492 
=14e72 
41.89 
52022 
-32.35 
16092 
-26037 


SECOND 
+18 MO 


+ 9 MO 
89,56 
-15.89 
-25e66 
-57,08 
-72036 
36927 
-22,05 
-15,38 


YEAR AFTER PEAK 


+21 MO 





STANDINGS FIRST YEAR AFTER PEAK ' 

DATE AT PEAK +1MO +2M0 +3 M0 +4MO #5 M0 +6M0 +7M0 +¢6MO0 #9M0 410 MO #11 MO #12 MO 
1923 5 49840 496 498 493 480 484 488 462 479 476 471 474 466 
1926 10 48463 482 477 478 478 478 476 482 478 476 81 477 473 
1929 @ 488,47 468 492 468 466 459 461 456 455 449 446 435 426 
1937 5 400,7* 393 387 383 373 368 355 342 343 348 347 347 347 
1945 2 452.7 452 451 442 446 445 408 417 414 411 412 408 405 
1948 1) 396.7 296 395 394 391 388 389 368 392 391 395 39% 390 
1953 7 40620 405 399 401 398 397 395 396 394 393 395 395 397 
1937 7 399.47 400 398 393 391 389 388 384 385 

STANDINGS SECOND YEAR AFTER PEAK 

DATE AT PEAK +13 MO +14 MO 415 MO +16 MO 17 MO +18 MO 419 MO +420 MO #21 MO 422 MO 423 MO +24 MO 
1923 5 498.0 458 456 460 464 474 475 480 462 479 481 478 480 
1926 10 484,53 477 473 477 478 477 471 477 481 482 484 463 “76 
1929 8 488.7 424 425 415 414 400 o18 418 417 428 ais 406 401 
1937 5 400,7* 346 357 362 369 368 366 371 376 376 375 371 370 
1945 2 452,57 407 409 401 401 401 605 401 402 402 403 405 406 
1946 11 396,47 393 397 397 397 401 402 405 409 412 408 410 #10 
1953 7 406.0 397 397 397 400 400 463 404 406 406 #10 407 406 
1957 7 399,47 

PER CENT CHANGES FIRST YEAR AFTER PEAK 

DATE AT PEAK + 1M0 +2M0 +30 +4M0 +5 M0 +6M0 +7MO0 +8 MO +9 M0 410 MO #11 MO +12 MO 
1923 5 498.0 -240 200 -1200 -3261 -281 -2.01 -3621 -3-81 4,42 5042 ~4o82 6043 
1926 10 48463 -o48 “1651 -1e31 “1631 ~1e31 -le72 -048 -le3l -1e72 069 ~1e51 2034 
1929 8 488,7 -914 268 4423 —4 264 -6207 5066 -7209 -6089 -8212 6073 -10098 -12041 
1937 § 400.7 -1691 -3eel bel 6090 -Bel5 11040 -14064 -14039 -13514 -13039 -13039 =13039 
1945 2 45247 -e15 037 =2036 -1eS7? =1069 -9087 =7088 -6e54 -9620 -8098  -9087 10053 
1948 11 39667 -017 -042 ~067 -1eO3 =2018 -1093 =-2018 -1e18 =1443 042 -067 ~1e68 
1953 7 46620 025 <-1le72 -1e23  -1097 =-2022 ~2071 2046 -2096 =3020 =-2071 =-2071 =2022 
1957 7 399.7 008 042-1067 =2017 =2067 =-2092 +3092 -3067 

PER CENT CHANGES SECOND YEAR AFTER PEAK 

DATE AT PEAK 413 MO +14 MO +15 MO 416 MO +17 MO +18 MO +19 MO #20 MO +21 MO +22 MO +23 MO +24 MO 
1923 $ 498.0 8.03 -8e63 -7063 -6083 4082 4062 -3061 -3021 —3081 -3e42 4002 3061 
1926 10 484,3 -1le51 -2034 -1e51 <-2e32 -leSl -2075 -1051 269 948 07 -027 ==1e31 
1929 8 4884.7 13623 -13003 -15007 -15028 ~-1801% -14066 -14046 -14067 -12041 ~-15028 -16092 -17094 
1937 § 40007" -13064 -10090 -9065 -7090 -80l5 -68065 -7040 -60156 -6016 6041 ~-7040 -T7065 
1945 2 45207 10009 9665 -Lle4) =LleS1 -LleG1l =-10053 -2leG1l -11el9 <-11e019 -10097 ~-10053 ~-10075 
1948 11 39667 ~092 208 208 08 1209 1634 2010 3e1) 2.87 3e1l 2436 3036 
1953 7 4060 2022 =2022 <-2022 -1048 -1068 -074 049 200 200 +98 025 #00 
1957 7 39947 


MEASURING RECESSIONS 








AVERAGE WORK WEEK 


IN MANUFACTURING 


303 
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LAYOFF RATE MANUFACTURING 


STANDINGS FIRST YEAR AFTER PEAK 
aC +5 MO +¢6M™0 +¢7M™O0 + 8 MO #9 MO 


av PFak + 1 MO MC 4 6 
5 22 32 21 131 
42 64 31 46 31 

a2 72 

122 118 

308 237 

80 

110 


140 


STANDINGS SECOND YEAR AFTER PEAK 
AT PEAK +13 MO #15 MO +16 MO +17 MO +18 MO +19 MO #20 MO +21 MO +23 MO 


6.7 216 271 207 260 223 182 229 104 102 
20.7 83 59 ? 34 29 30 52 
50.3 7 60 53 51 
5263 97 3 87 95 118 


391 f 385 


230 260 


ABSOLUTE CHANGES FIRST YEAR AFTER PEAK 

*3M™0 +¢4M™0 +5 MO #6M60 ¢ 760 + 8MO + 9 MO 

~le? 2503 1463 12403 

21¢3 1003 1463 

3107 347 367 
6907 
177.7 12403 15603 
6667 : 3363 9303 
16343 : 8667 7607 
7607 10607 


10363 11303 


ABSOLUTE CHANGES 
AT PEAK +413 MO #15 MO +16 MO +17 MO 1 O +1 +23 MO 
6.7 20903 3 26463 25363 7 22 : 9563 
46.3 31.3 
1707 2 e7 
44.7 


7723 








DATE 
1920 1 
1923 5 
1926 10 
1229 8 
1937 5 
1945 2 
1948 11 
1953 7 
1957 7 

DATE 
1920 1 
1923 5 
1926 10 
1929 86 
1937 5 
1945 2 
1948 11 
1953 7 
1957 7 

DATE 
ag20 1 
1923 5 
1926 10 
1929 8 
937 5 
945 2 
$48 11 
953 7 
9s7 7 

DATE 
920 1 

923 5 
2926 10 
929 6 

937 5 

945 2 

948 11 

e337 

957 7 





MEASURING RECESSIONS 


AY PEAK 


1190.0 

110547 
42940 
42667 
388,0* 


$7667 


AT PEAK 


1190.0 
110567 


429.0 


AT PEAK 
1190.0 
110567 
429,0 
426.7 
388,0* 
57607 
40667 
400,0 


30363 


AT PEAK 
1190,0 
110567 
429,0 
42667 
388.0% 
57667 
40607 


400,0 


+ 1 MO 
1266 
1156 

393 


396 


520 
370 


360 


i) 
@ 
=) 


+13 MO 
260 
236 
310 


147 


449 


290 


+13 MO 
-91000 
86907 
-11940 
27907 
6940 
17303 


3303 


-11020 


GROSS ACCESS 


ION RATE MANUFG 


STANDINGS 


+ 4 MO + 5 MO 
975 1352 
787 57 

72 374 
200 216 
326 261 
540 560 
320 320 
300 300 
240 240 

STANDINGS 

+16 MO +17 MO 
288 160 
292 239 
291 311 
113 132 
3389 476 
610 720 
380 390 
360 360 


ABSOLUTE CHANGES 


+4M™0 + 5 MO 
-215.0 162.0 
“31807 53007 
-57.0 -55.0 
-22607 -21007 
6209 -127.0 
-3607 ~1607 
8607 -86.7 
-100.0 -100.0 
-6303 -0363 


ABSOLUTE CHANGES 


+2M0 + 3 MO 
1278 1010 
961 721 
371 331 
346 232 
311 275 
510 540 
330 330 
340 290 
280 260 
#14 MO +15 MO 
292 274 
216 260 
324 305 
133 100 
445 434 
730 660 
370 360 
290 320 
+2 M0 + 3 MO 
68.0 ~180.0 
-14467 -384e7 
-58,0 -98.0 
~B0e7 -194e7 
“77.0 -11360 
~6667 3607 
~T6e7 -7607 
760.0 -110,0 
-2303 ~4303 
+14 MO +15 MO 
~89820 =-916.0 
“88907 84507 
-105.0 =-124e0 
29307 32607 
57.0 46.0 
15363 8343 
~3607 ~4607 
-110.0 -80.0 


+16 MO +17 MO 
=90200 -103000 
81307 -86667 
-138.0 ~-118.0 
-31307 -294e7 
120 88.0 
3303 14363 
-2607 1607 
-40.0 -4020 


FIRST YEAR AFTER PEAK 


*+6MO0 + 7 M0 
1253 1237 
469 478 
379 378 
240 199 
195 262 
560 670 
350 370 
290 260 
260 240 


SECOND YEAR 
+18 MO +19 MO 


101 149 
426 472 
319 371 
155 199 
461 398 
670 640 
440 400 
350 360 


FIRST YEAR AFTER PEAK 


+6MO + 7 MO 
6340 4720 
~63607 =-627e7 
—5020 -5120 
18607 <-2270e7 
-193.0 =12620 
-1607 9303 
~56e7 ~3607 
=1104.0 -120.0 
~4343 6303 


SECOND YEAR 
+18 MO #19 MO 


-1089e0 -104140 


-67907 -63307 
-110.0 -58.0 
~27le7 =227e7 
73.0 10,0 
9303 6303 
3303 ~6e7 
-50.0 -4020 


+8 MO +9 M0 
1045 592 
556 478 
352 306 
194 194 
367 352 
780 880 
340 360 
290 260 
AFTER PEAK 
+20 MO +21 MO 
200 435 
484 442 
323 379 
167 155 
397 344 
620 580 
460 540 
380 370 
+ 8 MO + 9 MO 
-145e0 -59800 
“54907 62707 
-77e0 =12360 
—23207 <-23207 
-2140 -3600 
20303 30303 
6607 ~4607 
-11000 ~-14000 
AFTER PEAK 
+20 MO +21 MO 
99000 -75540 
-621e7 66307 
-10640 -5020 
25907 -27le? 
920 -44,.0 
4303 303 
5303 13363 
-2020 -3000 


+10 MO 
426 
361 
282 
154 
326 


630 


270 


+22 MO 
454 
447 
397 
126 
348 
520 
500 


380 


+10 MO 
-76420 
=74467 
-147.0 
-272e7 
62.0 
25303 
4607 


-130.0 


#22 MO 
-73600 
-658e7 
$2.0 
-30007 
-40.0 
5607 

9303 


-2020 


305 


+11 MO 
342 


+23 MO 


450 
377 
151 
318 


560 


+23 MO 
-70720 
65507 
-5200 
~275e7 
-70.0 
-1607 

733 
5000 


+12 m0 
299 
233 
307 


105 


760 
370 


299 


+24 MO 
560 
358 
429 
128 
354 


560 


+12 MO 
-89100 
-8720e7 
-12200 
“32107 
-8340 
18303 
-3607 


-11020 


+24 MO 
-63020 
-T47Te? 
20 
29807 
-3420 
~1607 
3303 


6000 





151,0* 
12220 

6158.0* 
74865,3 
1112563 


13136,0 


AT PEAK 
564,? 
857.3 
974,7 

151,08 
12220 

8158.0" 
748563 
12125,3 


13138,0 


151,0* 
122.0 

8158,0* 
748563 
1112563 


131278.0 


AT PEAK 
56462 
85763 
9747 
151.0* 
12240 

8158,0* 

7485.3 

1112543 


13138,0 
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ORDERS FOR DURAALE 


104 
4776 
6911 
9622 


12154 


7176 
7514 


11103 


* 4 


STANDINGS 


MO 


499 


2767 
6735 
9416 


12362 


+35 “0 


470 
R86 


$80 


92 
4547 
6223 
6928 


11399 


STANDINGS 


+16 MO 


82 
5616 
8081 


10856 


+17 MO 


96 
6188 
6198 


11786 


PER CENT CHANGES 


+ 30 


le 
-3,97 
12029 
-13.70 


~e26 


+13.M0 
49650 
-1e20 
048 
-38041 
~40016 
~32046 
294% 


~7076 


* 4 MO 


-15689 
~19267 
96208 
10202 
-15036 


-5091 


+ 5 ™O 


-16072 


CENT CHANGES 


+16 “0 
35085 
24067 


3062 


=31066 
-32079 
lel? 

7496 


2042 


+17 MO 
=32021 
27484 
=1¢50 
=59260 
=21¢31 
~24e15 

9252 


5292 


GOODS 


FIRST YEAR AFTER PEAK 


«6 “0 
516 
e523 


240 


84 


6145 


9097 


10706 


*7?s™o 


439 


1936 
6261 
9842 


10549 


SECOND YEAR 
718 MO +19 MO 


459 


+ 8 “oO 


AFTER PEa 
+20 MO 


501 
1077 
990 
67 


96 


«9 m0 
327 
oR? 


arc 


x 
+21 MO 


414 


5916 
14260 
12489 


FIRST YEAR AFTER PEAK 


+6 ¥O 
8256 

-250 
-3056 
-12058 
31015 
-99499 
“17691 
-16¢23 


718053 


7 MO 


+ 


29280 
-37e70 
-76027 
16062 
~11653 


=19.72 


SECOND YEAR 


#18 “O +19 MO 


37027 
43482 
55 
=54630 
718685 
+2613 
26203 


11049 


716066 
33090 
5068 
~51066 
719067 
=29092 
34.70 
12048 


+ 8 MO 
-34,08 
26020 
=4658 
26049 
39934 
50048 
-22058 


=14530 


AFTER PEA 
+20 MO 
~lle22 

25062 
1457 

55063 

~21e31 

=24e52 
52.79 


21069 


+o “Oo 
-42,06 
1be42 
-10.76 
-29,80 
-36589 
-40.70 
-10691 


-15209 


+10 MO 
~S4e71 
134469 
5061 
~29e14 
-35025 
“37015 
~6059 


14052 


+22 “0 
2095 
2088 
048 

-58028 

-19%067 

-22059 

60.81 
21490 


~61059 
=27005 
~26048 
58034 
20085 





OATE 
1920 


1923 


i 


5 


1926 10 


1929 
1937 


1945 


8 
5 
2 


1946 11 


1953 
1957 


7 
7 


MEASURING RECESSIONS 


AT PEAK 


16056% 


3121,.0* 
4316.0" 
3186.3 
206557 
504.0 
353163 
558993 


10217.0# 


aT PEAK 


1805.3 


3121,0* 
4316,0* 
3186.3 
2065.7 
504.40 
353163 
$5893 


10217,0* 


aT PEAK 


1805.3 


3121.0* 

4316,0* 
3186.3 
2065.7 
504.0 
3531.3 
$569.3 

10217,.0# 


aT? Prak 


1805,_3 


3121.0" 
4316.0* 
3186.3 
206547 
504.0 
3531.3 
5589.3 


10217,0# 


+ 2 0 


155° 


2709 
4187 
258) 
1890 

512 
3750 
$324 


10551 


+13 MO 


=36069 


4e81 
1.53 
-37.58 
4463 
838.69 
66479 


26029 


RESIDENTIAL CONTRACTS 


+ 2 ™0 


1566 


3120 
4019 
2528 


1688 


10667 


+2 ™0 


13,37 


-203 
~6088 
20066 
-8.60 
74.80 
~25.89 
-e13 


4&e40 


+14 "0 


-35025 


-5209 
14039 
~38.49 
142 
965.67 
56680 


50034 


+ 3 oO 


1711 


2954 
4311 
2182 
1892 

833 
3332 
6338 


10534 


#15 “0 


n2 


140 


PER 
#15 MO 


-22029 


-3052 
24014 
-39268 
1086 
1109072 
63268 


47062 


STANDINGS 
+ 4M0O + 5 MO 


1336 B46 


2750 3817 


4048 4325 
2194 1895 
1736 1443 
757 855 
2915 3472 
5710 5754 


9930 9283 


STANDINGS 
+16 MO +17 MO 


1$89 1749 


3070 3120 


5524 4899 
18615 1677 
1998 2471 
5052 


7375 


4626 
7867 
6157 9543 
CENT CHANGES 

+ 4 MO 


+ 5 mo 


-26200 +5325 


11089 22230 


6021 e21 
“31014 -40253 
-15086 ~-20014 
50220 


717045 


69264 
-1.68 
2016 2.95 


-281 -9e14 


CENT CHANGES 
+16 MO 17 MO 


11098 =3012 


~1063 -«03 


27099 13.51 
~43.04 -47637 
-3028 19262 
902e38 817286 
108.84 122678 


45494 70074 


FIRST YEAR 
*6"Oo *#¢ 70 


641 966 


769 944 


3647 4633 
6174 6129 


9564 7840 


SECOND YEAR 
+18 MO #19 MO 


1623 1930 


3356 3489 


4596 5054 
20706 1904 
2187 2553 
4560 5178 
7198 7785 


8954 8667 


AFTER PEAK 


*68m”o +9 M0 


798 980 


4173 
3773 
2022 
1522 
1607 
4136 
6538 


AFTER PEAK 
+20 MO +271 MO 


2337 2132 


3305 3713 


$021 4653 
1825 1889 
2938 2621 
3418 3387 
7955 7545 


8902 6768 


FIRST YEAR AFTER PEAK 


*6M0 «+ 7 MO 


52031 -86260 


12450 11089 


“7025 =-20230 
“40634 -43,98 
21009 -33444 
52058 87630 
3028 31620 
10046 9065 


6039 =-23026 


SECOND YEAR 
#18 MO +19 MO 


-10010 6091 


7053 11e79 
6049 
~35003 


5087 


17210 
~40024 
23459 
804076 927638 
103083 120045 


602020 $5206 


+8 MO «9 M0 


-55.80 -45,72 


49086 33,71 


“5047 -12.58 
-35073 =36,54 
-33639 -26032 
115087 228,85 
16016 17212 


12423 16097 


AFTER PEAK 
+20 MO +2) MO 


29045 16.09 


5290 16,97 
7.81 


-40.72 


16033 
~42472 
42023 36.57 
57801? $72602 
125027 113466 


59027 56,87 


+10 “QO 


925 


+10 MO 


48476 


28.93 
71658 
~39205 
-7049 
253677 
66062 
21659 


#22 “Oo 
21092 


16062 
6049 
50041 
41065 
519064 
72020 
39019 


#11 MO 


57946 


19442 
-7207 
~40040 
-12081 
360652 
$5010 
28430 


+23 “0 
27268 


28080 
4068 
~48206 
32e11 
925440 
65066 
67948 


+12 “0 
54202 


7695 
3494 
~454052 
-14099 
374460 
57e1l 
25201 


+24 MO 


23063 


30060 
0% 
-54e30 
34087 
705636 
79637 
$8200 





DATE 
1920 


1923 


1 


5 


1926 10 


1929 
1937 


1945 


5 


2 


1948 11 


1953 


1957 


> 


7 


AT PEAK 
3324.0 
1423,7 
1914,0* 
2256.0* 
115267 
129303 
1488.7 
204063 


3661,.0* 


AT PEAK 
3324.0 
142367 
1914,0* 
2256,0* 
115267 
129363 
1488,7 
204063 


3661,0% 


AT PEAK 
3324.0 
1423,7 
1914,0* 
2256,0* 
115207 
129343 
1488,7 
2040.3 


3661.0* 


AT PEAK 
332440 
1423.7 
1914,0* 
2256,0* 
115207 
1293.43 
1488,7 
2040.3 


9661.0" 


+13 MO 
-78e52 
~19.64 
-17e76 
-51.64 
-50081 
205.80 
-12.74 


-e75 
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COMMERCIAL AND INDUSTRIAL CONTRACTS 


+14 MO 
~81044 
-38.33 
-15205 
61444 
54054 
16126 
-13.48 


246 


-64_76 
-47025 
202440 
-16.57 


7283 


STANDINGS 
+4 M0 + 5 MO 

1726 1890 
1107 11246 
1786 1931 


2345 1360 


1012 723 
960 1446 
1412 1376 
2200 1301 
3377 2593 


STANDINGS 
+16 MO +17 MO 


852 


CENT CHANGES 
+4M™0 + 5 MO 
48.07 -43014 
-22024 -214205 
-6069 289 
309% -39072 
-12020 -37e28 
-25677 11280 
-5015 -7e57 
7e83 -36024 
-7e76 =-29017 
CENT CHANGES 

+16 MO +17 MO 
=74e37 <=-730e77 
-24e70 -14e16 
65203 -65678 
59640 -41070 
194666 135498 
15.00 30.45 


9092 1201 


FIRST YEAR AFTER PEAK 


+6MO0 +7 M0 


1434 1211 


78 1350 


1926 1884 


1730 1643 


703 735 
1415 2806 
11868 1209 
1924 1616 
3259 2861 


SECOND YEAR 
+18 MO +19 MO 


909 937 


1338 1273 


+8 MO + 9 MO 


1442 975 
1139 1495 


1830 1715 


1517 1454 


532 539 
2653 3512 
1165 902 


1800 


AFTER PEAK 
#20 MO +21 MO 
1004 1161 


1582 997 


FIRST YEAR AFTER PEAK 


+6M™O0 + 7 M0 


-56086 -63057 
-10023 5017 
263 -1057 
23032 <-27e17 
39001 -36023 
9e41l 116096 
20020 -14e76 
5270 -20280 
“10098 -21685 
SECOND YEAR 
+18 MO +19 MO 
-72065 <-71e81 
6002 ~10058 
-70021 =-72061 
46021 -48055 
62440 79423 
35076 34042 


14049 28016 


+8 MO + 9 MO 


56062 -70067 
-20200 5,01 
4039 -10040 
32076 -35055 
53085 -53224 
105013 171655 
—21le74 -239e41 


-11.78 -5016 


AFTER PEAK 

+20 MO +21 MO 
69080 -65.07 
llel2 =-29.97 
-70,08 -71,90 
34067 -34.07 
10320 47260 
32053 66226 


26055 6099 


+10 MO 
1020 
1213 
1688 
1226 
633 
2892 
1361 


2201 


+10 MO 
-69e31 
—14-80 
-11¢081 
45066 
45208 
123061 
-8058 


7e87 


-71¢01 
-33081 
12019 
49280 


13651 


+11 MO 
-76093 
-24-70 
-17087 
-46028 
-49068 
187016 
-18045 


4079 


-72069 
—34093 
83209 
52022 


21694 





MEASURING RECESSIONS 


AT PEAK 
1312.7 
105343 
3567eF 
391120 
2301,7* 
1397,_7* 
7093.0* 
859507 


1178620 


AT PEAK 
131207 
195343 
3567.3" 
3911.0 
2301, 7* 
1397.7 
7093,0* 
8595—67 


11786,0 


AT PEAK 
131207 
105363 
3567.3" 
391120 
23016” 
1397,7* 
7093,0* 
6595—7 


11786.0 


AT PEAK 


1397.7* 
7093— * 11288 
8595607 14.32 


11786,0 


NUMBER OF NEW INCORPORATIONS 


+ 2 M0 
1268 
1037 
3442 
3248 
2161 
1488 
6935 
6352 


11827 


12017 


20099 


+ 3 M0 
1250 
1036 
3545 
3747 
2110 
1614 
6697 
8702 


11843 


+15 MO 


228026 
14080 


20065 


STANDINGS 


FIRST YEAR AFTER PEAK 


+ 4 MO 


1164 


2078 
1659 
6758 
8452 


10779 


+ 5 MO 
1192 
915 
3770 
3529 
2002 
1717 
6612 
9005 


10682 


STANDINGS 


+16 MO 
932 


971 


+17 MO 
935 


990 


CENT CHANGES 


+ & MO 
-11¢33 
-18664 

-1e47 


-9092 


18270 
4072 
-1067 


~Be54 


+ 5 MO 
-9019 
-13013 
568 
9077 
-13002 
22085 
6.78 
4e76 


-9e37 


CENT CHANGES 


+16 MO 


-29,00 


14053 


31069 


+17 Oo 
-28.77 
-6001 
6021 
-14614 
-17258 
296 0b7 
7034 


40079 


+ 6 MO 
1138 
1034 
3707 
3628 
2070 
1975 
6958 
6371 


11474 


+ 7 MO 
1101 
1004 
3548 
3521 
2043 
2409 
6981 
8982 


11017 


SECOND YEAR 
+18 MO +19 MO 


FIRST YEAR AFTER PEAK 


+ 6 MO 


—13031 


3091 


—7e24 


-10006 
41031 
-1090 
-2061 


-2065 


1001 
1038 
4147 
3591 
1969 
3819 
8520 


11967 


+ 7 MO 
-16012 
-4268 
~254 
9097 
-11e24 
72036 
1056 
4249 


6052 


SECOND YEAR 


+18 MO +19 MO 


-30037 
-160923 
5026 
~12063 
-13e11 
191091 
21043 


34051 


~23474 
-1046 
16025 
-8018 
-14945 
173024 
20012 


39022 


+ 6 MO 
1109 


964 


+ 9 MO 
1105 


965 


AFTER PEAK 


+20 MQ 
970 


+ 8 MO 
-15052 
-8048 
3019 
9241 
-15076 
113021 
—5065 


8e24 


+21 MO 


904 


+ 9 MO 
15082 
-8.39 
-eT7l 
4,99 
16202 
129210 
4664 


8464 


AFTER PEAK 


+20 MO 


=26010 


14067 
162¢87 
5061 


38013 


+21 MO 


-31.13 


-11.89 
-14915 
141619 
10235 


24433 


+10 MO 
-19093 
-12047 

3402 
10494 
-14015 
143062 

8.78 


290 


-9e15 
140¢33 
-056 


30079 


#11 MO 
913 


914 


+11 m0 
—30045 
-13023 
5206 
8062 
-16089 
211002 
2206 


9204 


+12 m0 
-31097 
16065 
202 
-17¢e57 
-13093 
229048 
10074 


14002 


14096 

“12091 
-2e33 
137025 
2055 
32001 





oaT 
1920 
1923 
1926 
1929 
1937 
1945 
1948 
1953 
1957 


OaTE 
1920 
1923 
1926 
1929 
1937 
1945 
1948 
1953 
1987 


2 
1 


5 
10 


aT PEAK 
78,0 
433.0 
359.0 
40520 
12940* 
361343" 
2669.0 
3516,7 
4867,0 


AT? PEAK 
78.0 
433,0 
359.0 
405.0 
129,0* 
3613,3* 
266920 
3516.7 
4867.0 


AT Peak 

78,0 
433.0 
359.0 
405,0 
12920” 
3613.3" 
2669.0 
3516.7 
4867.0 


aT? Peak 
78,0 
433.0 
359.0 
40520 
129,0* 
3613,3* 
266940 
3516.7 
4867,0 


* 1-0 
96 
363 


+ 10 
23.08 
~16017 
“9.75 
7290 
11063 
~2031 
16556 
8.83 


245 


+13 MO 
665,38 
-023 
7056 
48239 
38.76 
11625 
29430 
4.10 


AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 


LIABILITIES OF BUSINESS FAILURES 


+2 0 
102 


+14 MO 
585.90 
~1e15 
13.09 
61073 
27613 
1.57 
5239 
16025 


*3 M0 
12? 
361 
424 
536 


153 


PER 
+15 MO 
375064 

34.18 

9047 

40674 
29046 
3023 
715029 
710037 


STANDINGS 

+4 M0 + 5 MO 
116 418 
363 891 
460 475 
544 4le 
115 152 
3370 3620 
3099 3100 
4000 4207 


$750 4358 


STANDINGS 

+16 MO +17 MO 
614 438 
434 406 
442 435 


675 644 


142 


CENT CHANGES 
+4M0 + 5 MO 
48.72 435.290 
“16017 105.77 
26013 32031 
34632 


-10085 


2072 
17283 
~6073 «18 
16013 16615 
13.74 19063 


18414 10.66 


CENT CHANGES 
+16 MO 
667.18 461.54 


#17 ™O 


23 
23012 
66067 


~6024 
21.17 
59201 
30223 10.08 
=12e27 +5690 
712089 ~22071 


8.40 9.65 


FIRST YEAR AFTER PEAK 
+ 


+6 M0 
255 


7 “Oo 
299 
498 410 
406 
462 551 
178 257 
1220 2100 
2508 3028 
2873 4083 


6256 558) 


SECOND YEAR 
+18 MO +19 MO 


498 452 
308 360 


365 388 


*8™MO0 + 9 MO 


374 437 
424 352 
337 


522 


AFTER PEAK 
+20 MO +21 MO 
468 597 
450 393 
377 344 
541 545 


189 


FIRST YEAR AFTER PEAK 


*+6"0 +7 M0 
226092 


25001 


283,33 
~5e31 
42062 13.09 
14207 
37298 


36005 
99022 


~66024 ~41688 


-6203 13645 
18030 16010 


28.54 14.67 


SECOND YEAR 
+18 MO +19 MO 
538046 479549 
726087 ~16086 
1067 8.08 
32059 44069 
3088 162001 
9059 71003 
~13034 +2720 


4e56 2023 


*6@mMo +9 M0 
379049 460.26 
~2008 16671 
21673 39.55 
286269 


40631 


39675 
58.14 
“13093 +6375 
-14601 17298 
45042 18.52 


200 200 


AFTER PEAK 
+20 MO +21 mO 
$00.00 665.38 
3093 = = 9024 
5001 6018 
33058 34.57 
46051 5043 
7Wel2 171022 
22093 39016 


4.62 026 


+10 MO 


+10 MO 
291203 
76098 
15406 
69078 
181040 
~59087 
17201 
4e25 
200 


+22 MO 
579049 
~37064 
70075 
38077 
26036 
315013 
=38055 


5099 


+11 ™0 
467 


#11 "0 
498472 
8454 
15060 
19675 
48.54 
19083 
-20008 
18032 
200 


+12 MO 
$31 


+24 0 
682905 
-8208 
9047 
48064 
16028 
-65079 
723019 
~4060 





MEASURING RECESSIONS 


DATE 
1920 
1923 
1926 
1929 
1937 
1945 
1946 
1953 


1957 


OATE 
1920 1 
1923 5 
1926 10 
1929 3 
1937 »> 
1945 2 
1948 11 
1953 7 


‘9s? 7 


OATE 
i920 1 
1923 «65 
1926 10 
ig29 66 
1937 5 
1945 2 
1968 1) 
1953 7 
1957 7 


AT PEAK 
2057247 
13520,.0 
12754,_7 
1263040 

1471¢7* 

183365 

1037,0* 

87663 


898.0 


AT PEAK 
2057247 
13520,0 
12754,7 
12630,.0 

1471,7* 

183563 

1037.0* 

878.3 


898,0 


AT PEAK 
2057207 
13520.0 
12754,7 
1263040 

1671,7* 

283363 

1027,0* 

678.3 
898.0 


AT PEAK 
2057207 
1352020 
127547 
12630.0 

1471,7* 

183363 

1037,0* 

878.3 


898,0 


+ 1 0 
20832 
13237 
12760 
12684 

1416 

183¢ 

10146 
886 


894 


#13 MO 
12117 
12260 
134646 
10362 


984 


BASIC PRICES 


* 3 "0 
20723 
12667 
12668 
12319 

1415 
1838 
901 
861 
846 


+15 M0 
11096 
12716 
13550 
9946 
1038 
1918 
821 
902 


STANDINGS 


+4 m0 
20304 
13006 
12536 
11958 
1382 
1837 
856 
877 


345 


+5 Oo 
19614 
13118 
12542 
11597 

1256 
1836 
801 
679 


846 


STANDINGS 


+16 MO 
10719 
12904 
13434 
9672 
1036 
1972 
612 
911 


+17 MO 
10672 
13174 
13378 
9408 
1052 
2402 
826 
69) 


PER CENT CHANGES 


* 3 40 
73 
-4283 
-268 
-2046 
-3085 
025 
w13011 
-1097 


5.79 


+40 
“1e31 
73280 
~le7) 
~5032 
-6.09 
+20 
17045 
~e1S 
-5.90 


+s ™O 
#266 
72097 
1067 
“8.18 
714.79 

215 
-22076 

208 


~5e79 


PER CENT CHANGES 


+15 MO 
46005 
~5095 
6024 
-21625 
29047 
4262 
-20083 
2069 


+16 MO 
-47290 
~4.56 
5.33 
-23042 
~29060 
7256 
-21070 
3672 


+17 MO 
~48012 
2056 

4.89 
-25651 
~28658 
31202 
~20035 


144 


FIRST YEAR AFTER PEAK 
+ 


6 MO 
19090 
13287 
12486 
11369 
1139 
1839 


802 


646 


+ 70 
18401 
13354 
12433 
11206 

1105 
18446 
774 
879 


657 


SECOND YEAR 
+18 MO +19 MO 


10893 
13440 
13427 
9230 
1049 
2414 
675 
910 


FIRST YEAR AFTER PEAK 


+ 6 MO 
=7623 
+1072 
-2013 
-9298 
-22060 

32 
~22066 

+53 


~5e79 


11072 
13732 
13316 
9199 
1036 
2405 
890 


915 


*7M™O 
10056 
71le23 
~2052 
711029 
-24691 
258 
-25036 
208 


4,57 


SECOND YEAR 


#18 MO 419 MO 


47005 
~059 
5027 

-26085 

28072 

31067 

15062 

360 


~46018 
1.57 
4038 
-27e17 
-29060 
31018 
14018 
4e17 


@ MO 
17442 
13236 
12403 
11061 

1120 

1857 

785 
896 


as? 


*¥ MO 
16292 
13046 
12482 
10856¢ 

1093 
1868 
640 


929 


AFTER PEAK 


+20 MO 
11138 
13910 
13166 
907r 
1039 
2523 
989 
689 


*emo 
15022 
2011 
2076 
712042 
723490 
1629 
-24.30 
2001 


~4957 


+21 MO 
11270 
13860 
13166 
e778 
1035 
2904 
1058 
902 


+o m0 
720281 
-3,51 
~2014 
-14,05 
~25.73 

1.89 
-19200 

5077 


AFTER PEAK 


+20 MO 
~45086 
2088 
3022 
-28.18 
-29040 
37062 
~4063 
1e21 


+21 MO 
~45022 
2.51 
3922 
=30.50 
29067 
$8.40 
2002 
2069 


#10 MO 
14650 
12776 
12742 
10668 
10746 
187) 

64s 


933 


+10 MO 
-28679 
~5250 
-010 
715053 
=27202 
2205 
~18023 
6200 


#12 “0 
-39016 
“8210 
4051 
-17036 
32039 
2655 
722008 
4e1? 


+24 MO 
744061 
7038 
2047 
731058 
~29074 
72015 
17¢55 
3072 





DATE 
1920 1 
1923 5 
1926 10 
1929 
1937 
1945 
1948 
1953 


1$57 


DATE 
1920 
1923 
1926 


1929 


aT PEAK 
102733 
962020 
15595.0 
3543167 
1732843 
15566,7 
180326.0 
2698063 
5026267 


AT PEAK 
10273,.3 
962020 
15595.0 
3542167 
1732843 
15586667 
18036,0 
2698043 


5026267 


AT PEAK 
1027343 
9620.0 
1559540 
3543167 
1732863 
155867 
1803620 
2698043 


5026247 


AT PEAK 
1027363 
962060 
1559520 
3543147 
1732843 
15586,7 
1803640 
2698003 


5026207 


+ 1 MO 
9650 
9255 

15395 
36235 
17030 
15690 
17557 
27221 


48797 


#13 MO 
7570 
9330 

186995 
22500 
12280 
19451 
19662 


34606 


+ 10 
6.07 
-3,.79 
“1023 

2027 
1le72 


066 


#13 MO 
26631 
3.01 
21280 
36050 
~29¢13 
24079 

9202 


28626 
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INDEX OF INDUSTRIAL STOCK PRICES 


+2 0 
9795 
8930 

15930 
29150 
17785 
16022 
17828 
26190 


47179 


+14 MO 
7505 
9925 
192800 
19875 
13975 
20375 
19979 


35271 


+14 MO 
-26295 
3017 
26096 
~43291 
-19¢35 
30072 
10.77 
30073 


*3 M0 
9945 
9045 
15465 

22820 
18295 
16608 
17574 
27073 


44338 


+15 MO 
7700 
10355 
19895 
18095 
14095 
20658 
20346 


35829 


PER 
+15 MO 
725005 

7064 
27057 
-46093 
18066 
32054 
12281 


32080 


STANDINGS 


+4 M0 
9110 
9075 
15815 
24720 
16025 
16674 
17606 
27710 


43673 


+ 5 MO 
9170 
8815 

16010 
25565 
12990 
16400 
17520 
26115 


43694 


STANDINGS 


#16 MO 
7670 
10290 
19535 
17215 
13650 
20600 
20630 
37575 


+17 MO 
6920 
10165 
20450 
16725 
14865 
20139 
21267 


39384 


CENT CHANGES 


+ 4 M0 
-11232 
-5e67 

1e41 
-30023 
-7052 

6298 
2038 

2270 
-13e11 


+ 5 MO 
-10074 
-8e37 

2266 
-27685 
-19426 

5022 
-2086 

4e21 
-13.07 


CENT CHANGES 


#16 MO 
25034 
6096 
25426 
51041 
21023 
32016 
14.38 


39027 


+17 MO 
-32264 
5067 
31013 
-52080 
714022 
29021 
17291 


45097 


+6” 

9070 

9065 
16405 
26740 
12475 
16792 
17250 
28664 
44568 


FIRST YEAR 
ofe+T7M 


ie) 

8525 

9410 
16880 
27825 
12435 
17780 
16488 
29213 


44416 


SECOND YEAR 
+18 MO +19 MO 


6860 
10765 
21245 
18155 
15225 
19686 
21936 
39843 


FIRST YEAR AFTER PEAK 


+ 6 M0 
~lle7l 
-5e77 
5019 
-24053 
-28201 
7073 
~4036 
6424 


-11033 


6695 
11545 
21630 
18005 
15110 
17318 
22102 
41025 


* 7 M0 
-17002 
-2018 

8624 
-21047 
-28024 
14,07 
8058 

8627 
-11063 


SECOND YEAR 


+18 MO #19 MO 


33022 
11290 
36023 

~48 676 

-12014 
26430 
21062 


47067 


34-83 
20001 
38070 

~49018 

-12¢80 
llell 
22054 
52005 


AFTER PEAK 
+ 8 MO 


8650 

9780 
16885 
28550 
12725 
18506 
17227 
29915 
45014 


+ 9 M0 
A485 
9880 

17535 
26670 
12545 
18934 
17943 


31092 


AFTER PEAK 
+20 MO +21 MO 


6935 
12155 
21150 
15800 
14565 
16953 
20530 
40891 


+ 8 MO 
15280 
1066 
8e27 
719042 
26056 
18.73 
4049 
10.88 
10044 


7170 
12045 
21085 
14145 
14485 
16898 
21660 
42299 


+9 ™0 
-17.41 
2470 
12.44 
-24,73 
-27260 
21.48 
052 


15424 


AFTER PEAK 


+20 MO 
-32050 
26035 
35062 
55041 
15695 
8e77 
13.83 
51656 


+21 MO 
-30021 
25621 
35420 
-60.08 
~16041 
8.41 
20.09 
56.78 


#10 MO 
7930 
9560 

18365 
24315 
11475 
19244 
16066 
32286 


+22 MO 
-26031 
25010 
45072 
60068 
18002 
10093 
23076 
56024 


+11 MO 
-29072 
—4e42 
25023 
35014 
35094 
27011 
3022 
21654 


+23 MO 
-22032 
24543 
53045 
-58.98 
26065 
12404 
27015 
63037 


#12 MO 
-27024 
-6003 
21071 
35042 
34047 
26007 
5079 
26049 


+24 MO 
~21654 
30051 
58067 
-60054 
-23025 
16203 
27018 
71029 





MEASURING RECESSIONS 


SOURCE NOTES AND UNITS 














Series and Period Covered Unit Source 


. Nonagricultural Employment 1929-39 | 10,000 persons BLS and FRB 
1945-58 1,000 persons BLS 
. Unemployment Rate 1929-39 | Per 10,000 in labor force NICB and NBER 
1947-58 | Per 10,000 in labor force Census 
tross National Product 1923-29 | $10 million, annual rate Barger and Klein 
1945-58 | Million dollars, annual rate | OBE 
. Industrial Production 1920-58 | 1947-49 = 100 FRB 
. Freight Cars Loaded 1920-58 | 1,000 cars per week AAR 
. Bank Debits outside N.¥.C. 1920-39 | $100 million FRB, 140 centers outside N.Y.C. 
1945-58 | $100 million FRB, 344 centers outside N.Y.C. 
. Personal Income 1923-28 | $10 million Barger and Klein 
1929-58 | $100 million, annual rate OBE 
. Retail Store Sales 1920-31 | 1947-49 = 100 FRB (department store sales only) 
1937-50 | Billion dollars OBE 
1953-58 | Billion dollars Census 
. Wholesale Prices, ex Farm 1920-25 | 1926 =1,000 BLS 
Products and Food 1926-58 | 1947-49 =1,000 BLS 
. Corporate Profits after Taxes 1920-39 | $10 million, annual rate Barger 
1945-58 | $100 million, annual rate OBE 
. Average Workweek in Man- 1923-31 | 1/10 hour per week NICB, 25 mfg. industries 
ufacturing 1937-58 | 1/10 hour per week BLS, all mfg. industries 
. Layoff Rate, Manufacturing 1920-31 | Per 10,000 employed Metrop. Life Ins. Co. and BLS 
1937-58 | Per 10,000 employed BLS 
. Gross Accession Rate, Man- 1920-31 | Per 10,000 employed Metrop. Life Ins. Co. and BLS 
ufacturing 1937-58 | Per 10,000 employed BLS 
. New Orders for Durable 1920-28 | 1923-25 =1,000 NBER 
Goods 1929-39 | 1935-39 = 100 NICB 
1$45-58 | Million dollars OBE 
. Residential Construction 1920-22 | 10,000 square feet Dodge, 27 eastern states 
Contracts 1923-25 | 10,000 square feet Dodge, 36 eastern states 
1926-55 | 10,000 square feet Dodge, 37 eastern states 
1957-58 | 10,000 square feet Dodge, 48 states 
. Commercial and Industrial 1920-25 | 10,000 square feet Dodge, 27 eastern states 
Construction Contracts 1926-28 | 10,000 square feet Dodge, 36 eastern states 
1929-55 | 10,000 square feet Dodge, 37 eastern states 
1957-58 | 10,000 square feet Dodge, 48 states 
. Number of Incorporations 1920-25 | Number Evans, 6 states 
1926-31 | Number Evans, 8 states 
1937-39 | Number Evans, 6 states 
1945-47 | Number Evans, 4 states 
1948-58 | Number Dun and Bradstreet, 48 states 
. Liabilities of Business Fail- 1920-31 | $10,000 Dun and Bradstreet 
ures 1937-39 | $:),000 Dun and Bradstreet 
1945-58 | %.9,000 Dun and Bradstreet 
. Basic Commodity Prices 1920-31 | 1,10 cent Bradstreet, 96 commodities 
1937-47 | Aug. 1939 =1,000 BLS, spot prices, 28 commodities 
1948-50 | 1947-49 = 1,000 BLS, spot prices, 22 commodities 
1953-58 | 1947-49 =1,000 BLS, spot prices, 22 commodities 
20. Index of Industrial Stock 1920-28 | Cents per share Dow Jones, 20 indus. common stocks 
Prices 1929-58 | Cents per share Barron's, 30 indus. common stocks 














BLS: U. 8. Bureau of Labor Statistics. 

FRB: Board of Governors of the Federal Reserve System. 

NICB: National Industrial Conference Board 

NBER: National Bureau of Economic Research, 

Census: U, 8. Bureau of the Census. 

Barger and Klein: Harold Barger and Lawrence Klein, unpublished estimates. 

OBE: U. 8. Department of Commerce, Office of Business Economics. 

AAR: Association of American Railroads, Car Service. 

Barger: Harold Barger, Outlay and Income in the United States, 1921-19388 (New York: NBER, 1942). 

Metrop. Life Ins. Co.: Metropolitan Life Insurance Company. 

Dodge: F. W. Dodge Corporation, Dodge Statistical Research Service. 

Evans: George Heberton Evans, Jr., Business Incorporations in the United States: 1800-1943 (New York: NBER, 
1948), Appendix 6. 

Dun and Bradstreet: Dun and Bradstreet, Inc. Prior to February 1933: R. G. Dun and Co. 

Bradstreet: The Bradstreet Co. 

Dow Jones: The Dow Jones Averages, 13th Edition (New York: Barron's Publishing Company, Inc., 1948), and 
“Barron's National Business and Financial Weekly.” 











10. 


11. 


12. 
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APPENDIX B 


SOURCES OF CURRENT DATA AND SEASONAL ADJUSTMENT FACTORS 


I, SOURCES OF CURRENT DATA 


. Nonagricultural Employment 


U. 8. Department of Labor, Bureau of Labor Statistics, Employment and Earnings, 
monthly report and advance news release, “Employment, Hours, and Earnings.” 
Seasonal adjustment by compiler. 


. Unemployment Rate 


U. 8S. Department of Commerce, Bureau of the Census, “Monthly Report on the 
Labor Force” (Current Population Reports: Labor Force, Series P-57). Number of 
unemployed as per cent of civilian labor force. Seasonal adjustment by compiler. 


. Gross National Product (quarterly) 


U. 8. Department of Commerce, Office of Business Economics, Survey of Current 
Business and advance news release, “National Income and Product.” Preliminary 
estimates are published in Economic Indicators (Council of Economic Advisers, pre- 
pared for Joint Committee on the Economic Report). Seasonal adjustment by 
compiler. 


. Index of Industrial Production 


Board of Governors of the Federal Reserve System, Federal Reserve Bulletin and 
advance release, “Business Indexes.” Seasonal adjustment by compiler. 


. Freight Carloadings 


Association of American. Railroads, Car Service Division, Revenue Freight Loaded 
and Received from Connections, weekly report. Monthly averages derived by NBER 
from weekly figures; a week ending on the Ist, 2nd, or 3rd day of the month is in- 
cluded with the weeks for the preceding month. A preliminary estimate of the 
current monthly average may be obtained by averaging the last four weekly figures. 
Seasonal adjustment by NBER. 


. Bank Debits outside New York City 


Board of Governors of the Federal Reserve System, Federal Reserve Bulletin and 
advance release, “Bank Debits to Demand Deposit Accounts.” Derived by NBER 
by deducting New York City from total for all reporting centers. Seasonal! adjust- 
ment by NBER. 


. Personal Income 


U. 8S. Department of Commerce, Office of Business Economics, Survey of Current 
Business and advance news release, “Personal Income.” Seasonal adjustment by 
compiler. 


. Sales of Retail Stores 


U. 8. Department of Commerce, Bureau of the Census. Seasonally adjusted by 
Office of Business Economics in Survey of Current Business and in Bureau of the 
Census news release, “Advance Retail Sales Report.” 


. Index of Wholesale Prices, All Commodities other than Farm Products and Foods 


U.S. Department of Labor, Bureau of Labor Statistics, Wholesale (Primary Market) 
Price Index, monthly report. As a preliminary estimate for the current month, the 
weekly index for the week closest to the middle of the month may be used. (The 
BLS compiles a weekly index based on about 10 per cent of the commodities 
quoted monthly, available in weekly news release.) No seasonal adjustment 
necessary. 

Corporate Profits after Tax (quarterly) 
Same as series 3. 

Average Workweek, Manufacturing 
Same as series 1. Seasonal adjustment by NBER. 

Layoff Rate, Manufacturing 
Same as series 1 and advance news release, “Factory Labor Turnover.” Seasonal 
adjustment by NBER. 
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13. Gross Accession Rate, Manufacturing 
Same as series 12. Seasonal adjustment by NBER. 

14. Manufacturers’ New Orders, Value, Durable Goods Industries 
U. 8. Department of Commerce, Office of Business Economics, Survey of Current 
Business and advance news release, “Manufacturers’ Sales and Inventories.” 
Seasonal adjustment by compiler. 

15. Construction Contracts Awarded, Residential Building, Floor Area 
F. W. Dodge Corporation, Dodge Statistical Research Service, Construction Con- 
tracts: United States Summary, monthly report. Seasonal adjustment by NBER. 

16. Construction Contracts Awarded, Commercial and Industrial Buildings, Floor Area 
Same as series 15. Data are sum of “Commercial Buildings” and “Manufacturing 
Buildings.” Seasonal adjustment by NBER. 

17. New Business Incorporations, Number 
Dun & Bradstreet, Inc., Dun’s Statistical Review through 1957. Press release, “New 
Business Incorporations” thereafter. Seasonal adjustment by NBER. 

18. Current Liabilities of Industrial and Commercial Failures 
Dun & Bradstreet, Inc., Dun’s Statistical Review through 1957. Press release, 
“Monthly Business Failures” thereafter. Seasonal adjustment by NBER. 

19. Index of Spot Market Prices 
U. 8. Department of Labor, Bureau of Labor Statistics, “Daily Index Numbers and 
Spot Primary Market Commodity Prices,” weekly release; also daily in New York 
Times. Data used are for the trading day on or nearest the 15th of each month. No 
seasonal adjustment necessary. 

20. Index of Industrial Stock Prices, Dow-J ones 
Barron’s National Business and Financial Weekly; also daily in Wall Street Journal. 
Monthly average of daily closing prices of thirty common stocks. As a preliminary 
estimate of the current monthly average, the average for the preceding 30 or 31 
days may be used. No seasonal adjustment necessary. 











THE ACCURACY AND STRUCTURE OF INDUSTRY 
EXPECTATIONS IN RELATION TO THOSE OF 
INDIVIDUAL FIRMS* 


RoBEertT FERBER 
University of Illinois 

In this empirical study of bimonthly labor force expectations of 
Chicago area employers, considerable disparities were obtained between 
the accuracy and structure of the expectations at the level of the firm 
and at the level of the corresponding industry aggregates. The results 
lend support to the hypothesis that the structure of individual firm 
anticipations may be wholly dissimilar to that of the aggregate. Con- 
siderable reliance on mechanical projection of level is apparent among 
the individual firms. At the same time, there is evidence that firms 
which make some effort to anticipate employment are more accurate 
than other firms, judging by naive model tests. 


HE extent to which results obtained for aggregates are also valid for the 
Tinaividual units that make up the aggregates is one of the central questions 
of empirical analysis. The question is of particular relevance to business be- 
havior because the available data are generally in the form of aggregates and 
it is desired to gauge the applicability of the results to individual firms. The 
area of businessmen’s expectations is a case in point: firms are reluctant to 
make public their own expectations so that such data are likely to be released 
only in the form of aggregates. To impute characteristics of individual firm 
expectations is in many ways a hopeless task, considering the numerous more 
or less arbitrary assumptions involved in the disaggregation process. 

The present study represents an empirical exploration of the extent to which 
findings pertaining to the accuracy and structure of individual firm expecta- 
tions compare with those that would be obtained for the corresponding industry 
aggregates. The study is made possible by the availability of monthly labor 
force expectations, and the corresponding actual figures, for Chicago area em- 
ployers on a fairly continuous basis for several postwar years. By carrying out 
the analysis first on the individual firms and then on the aggregates resulting 
when the firms are combined into industry groups, comparisons can be made 
between the two sets of findings. 


THE DATA 


Since 1946 Illinois employers have been asked by the Illinois Department of 
Labor (as part of a larger program supervised by the U. S. Bureau of Employ- 
ment Security) to report monthly the anticipated size of their labor force two 
months in advance and four months in advance as well as the actual size of the 





* This study is an outgrowth of the project on “Expectations and Business Fluctuations” financed by the Mer- 
rill Foundation and ducted at the University of Illinois under the direction of Franco Modigliani. The author 
would like to thank the University Research Board and the Bureau of Economic and Business Research of the 
University of Illinois for supplemental assistance. He is also indebted to Franco Modigliani for helpful suggestions, 
to Nai Ruenn Chen and Fadil Zuwaylif of the Bureau staff for aid with the statistical analysis, and, last, but not 
least, to Elizabeth Slotkin and Stanley Neyhaus of the Illinois Department of Labor for their assistance in securing 
the basic data. 
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labor force in the given month. The only other data requested in this report 
relate to separations from, and additions to, the payroll during the given 
month. The reports are sent out about the middle of the month, are to be re- 
turned by the end of the month, and are published bimonthly by industry 
groupings. 

Coverage. Cooperation is sought of all business firms in the Chicago-Calumet 
Labor Market Area with 65 or more employees. In terms of sales volume, em- 
ployment or other indicators of economic importance, a substantial rate of re- 
turn is achieved because special efforts are made to secure returns from the 
larger firms in each industry. 

Representativeness of replies. The extent to which the anticipations represent 
the official outlook of the firm, if such exists, and the level of operations from 
which these anticipations emanate is difficult to estimate, since no questioning 
of firms could be conducted. To judge by the title of the person to whom data 
requests are sent, it is the personnel manager or his assistant who supplies the 
data in most instances, though occasionally a production manager or a vice- 
president enter the picture. However, quite a few returns are sent in with the 
signature of someone other than the addressee, usually a person of lower rank, 
often a clerk. This does not necessarily mean that the clerk works up the data. 
In the opinion of Department of Labor analysts, the personnel manager or 
another higher official is invariably consulted in such cases. 

Meaning of anticipations. The anticipations reported by the respondents are 
not strictly forecasts in the sense of predicting what is likely to happen. Rather 
they are more in the nature of conditional expectations. Thus, the wording of 
the question has been as follows: 

Provided an adequate supply of labor will be available, what will -our total 

employment be on: 
a. (15th of month two months hence)? 
b. (15th of month four months hence)? 

In periods of labor shortages, as in the immediate postwar years or during 
and shortly after a strike, these anticipations are, in a sense, conditional inten- 
tions; they are clearly not predictions of what is expected to happen and are of 
little value insofar as gauging actual business conditions is concerned. On the 
other hand, as long as manpower supplies are adequate, the anticipations would 
seem to be equivalent to predictions of future employment. This is true for 
nearly all of the period covered by the present study, which is Fall 1951 to the 
end of 1955, the principal exceptions being isolated cases of labor stoppages. 

Data used. The bulk of the analysis is carried out for 59 principal firms 
(plants)! in 14 industries in the Chicago-Calumet Labor Market Area. These are 
predominantly large firms because relatively complete records are available 
only for such firms, and it is these firms which have the greatest impact on 
economic activity. Restriction of the study to the Chicago area was necessi- 
tated by the lack of data for other areas not to mention other considerations as 
well, e.g., efficiency of operation. At the same time, resources did not permit 











1 The two terms are used interchangeably in this study though more than one plant of some firms was included 
in the sample. Insofar as hiring and employment expectations are concerned, these plants clearly operated inde- 
pendently of each other. 
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intensive study of more than a few industries, and these were selected with a 
view toward (a) availability of almost continuous reports for individual com- 
panies for almost four years, and (b) representation in the study of industries 
differing with respect to such characteristics as seasonal variations, suscepti- 
bility to cyclical influences, and growth trends. On the basis of these considera- 
tions, the following industries were selected for study from those eligible. The 
number of firms studied in each industry is shown in parentheses; the period 
of observation is shown at the right. 


Food, excl. meat packing (4) May 1952-Jan. 1956 
Meat packing (3) March 1952-Jan. 1956 
Printing and publishing (3) March 1952-Jan. 1956 
Iron and steel (12) Nov. 1951—Jan. 1956 
Cans (2) July 1952-Jan. 1956 
Other fabricated metals (3) May 1952-Jan. 1956 
Nonelectrical machinery (6) May 1952-Jan. 1956 
Farm machinery (3) Nov. 1951—Jan. 1956 
Electrical machinery (3) Sept. 1951—Jan. 1956 
Communication equipment (6) May 1952-Jan. 1956 
Automobiles and parts (3) Nov. 1952—Jan. 1956 
Other transportation equipment (5) March 1952-Jan. 1956 
Utilities (2) March 1952-Jan. 1956 
Retail trade (4) March 1952—Jan. 1956 


COMPARATIVE ACCURACY 


There are two principal vantage points from which the accuracy of a set of 
expectations can be considered. One is their accuracy as forecasts of level, 
namely, measures of the accuracy of the anticipations themselves without re- 
gard to any benchmarks of “good” or “bad” estimates. Distinct from this type 
of evaluation is the accuracy of the anticipations as forecasts of change, an ap- 
proach which eliminates to a large extent the inherent serial correlation in 
economic data.? 

Two aspects of the anticipations as forecasts of change are considered: their 
superiority relative to forecasts of no change, and their usefulness in providing 
information on forthcoming change. In the first sense, the accuracy of the 
anticipations is compared with the approach of naive models, set up to predict 
the actual movement on the basis of some simple mechanical extrapolation 
formula. If the accuracy of the anticipations is no better than simple mechani- 
eal projection, the anticipations are not likely to be of much value for direct 
forecasting purposes, though they may still possess value in other respects. 
There remains the possibility, however, that the anticipations may nevertheless 
throw light on impending change, a possibility that is explored by means of a 
correlation approach. 


Forecasts of Level 


The average absolute per cent error of the two-month and four-month antici- 
pations is presented in Table 320 on two bases of classification. On the one base 
are presented the averages of the errors of the individual firms in the industry 





* Admittedly, these are nct the only criteria of adequacy of a set of anticipations. Another facet, which could 
not be explored in this study, is the extent to which revisions in anticipations, and employment, are sensibly related 
to disappointments in plans. 
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TABLE 320 


AVERAGE ABSOLUTE PER CENT ERROR OF 
EMPLOYERS’ ANTICIPATIONS 























Two-month Four-month 
anticipations anticipations 
Measure of error Average of | Average of | Average of | Average of 
firms’ industry firms’ industry 
errors errors errors errors 
Average absolute per cent error 4.7% 3.2% 7.0% 5.0% 
Standard deviation of errors 7.0 3.5 9.9 5.7 
Coefficient of variation 148.9 109.4 141.4 114.0 
Average absolute per cent error by 
trend of employment 
Rising trend 5.7 3.3 7.4 4.9 
Level trend 2.4 2.1 3.3 3.5 
Falling trend 8.8 6.5 14.2 9.7 











as against the average absolute error that is obtained after the individual firm 
anticipations have been aggregated. The latter is the error that would be ob- 
tained if expectations data were reported only for the industry as a whole, as 
is true for the railroad shippers’ forecasts and most other types of data. For 
the former series, the errors of the individual firms in an industry are averaged. 

The second classification is based on the direction of movement of actual 
employment in the firm or industry, as the case may be, at the time to which 
the anticipation refers. Three trends are distinguished in the movement of 
employment: rising, level and falling. The definition of these trends depends 
on the extent of seasonal variation present in actual employment for each 
industry. If the relative amplitude of the seasonal index computed for the par- 
ticular industry is small, by which we shall mean less than 10 per cent, the 
trends are defined with reference to A,/A;—2. The figure for a particular month 
is said to exhibit a falling trend if A,/A;:-2 is less than .97, a level trend if 
A,/Avs is between .97 and 1.03 inclusive, and a rising trend if A;/A:-» ex- 
ceeds 1.03. 

On the other hand, if the seasonal index for a particular industry contains 
substantial fluctuations—meaning that the relative amplitude of the series is 
10 per cent or more—the trend in employment is defined with reference to the 
change between the current month and the corresponding month of the pre- 
ceding year, i.e., A;/A,1». This is true of six industries and of 22 sample firms.* 
In these cases, a 5 per cent change was selected as the boundary point for dif- 
ferentiating between trends, the trend being a rising one if A,/A;,~12 exceeded 
1.05, falling if A;/A;z-12 was less than .95, and level otherwise. 

The rationale back of this procedure is that consideration is likely to be given 
to seasonal fluctuations in anticipating future trends only where substantial 





* Two firms exhibited no seasonal pattern in employment in contrast to sharp seasonal patterns by the other 
firms in these industries (communications equipment, and printing and publishing), and one firm exhibited a sharp 
seasonal while the other firms in that industry (nonelectrical machinery) did nut. These firms were treated sepa- 
rately from the others in each industry, with no seasonal adjustment for the first two and computation of a separate 
seasonal pattern for the third firm. 
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seasonal fluctuations are present. The distinction between “moderate” fluctua- 
tions and “substantial” ones is admittedly, and necessarily, an arbitrary one— 
as are the breaking points between the three types of trends in employment— 
but it does nevertheless serve the purpose at hand. 

A number of significant points are evident in Table 320 with regard to the ac- 
curacy of these anticipations data. Most relevant to the present discussion, 
however, is the fact that the average error of the anticipations on the industry- 
wide basis is considerably less than the corresponding average for the individual 
firms, the differential being of the order, roughly, of 40 to 50 per cent. The same 
tendency is apparent for the variability in the errors, including the relative 
variability, and when the errors are segregated by trend in the labor force. 

This phenomenon may reflect either or both of two possibilities. One is that 
the process of combining individual firm anticipations permits for greater ac- 
curacy through cancellation of opposing errors. The other possibility is that it 
may reflect the greater accuracy of the larger firms’ anticipations. Exploring 
these alternatives reveals that it is mainly the cancellation of opposing errors 
rather than superior accuracy of the larger firms’ anticipations that accounts for 
the lower error of the anticipations on an industry-wide basis.‘ 

Similar results are obtained when the same comparisons are made for each 
of the 14 industry classifications used in this study.’ Though the accuracy of 
the firms’ anticipations varies substantially by industry, the accuracy of the 
industry-wide anticipations is invariably more than that of the corresponding 
firm anticipations, particularly so in the heavy goods industries such as iron 
and steel, the machinery industries, and other transportation equipment. The 
differential is low or non-existent, on the other hand, in meat packing and 
utilities. 

It is noteworthy that industries with high average errors for individual firm 
anticipations also have high average errors of anticipations on an industry-wide 
basis. The rank correlation coefficient between the two types of errors is .91 for 
the two-month anticipations and .70 for the four-month anticipations. 


Forecasts of Change 


Naive Model Tests. The ability of the anticipations to forecast change can be 
evaluated, from a practical point of view, only when we have some standard 
against which to assess their accuracy. Essentially, what we ask is how “good” 
or how “bad” the anticipations may be relative to other means of predicting 
employment. In practice, this is likely to be the most important consideration, 
for if the anticipations provide the best available means of forecasting employ- 
ment they may be a highly useful set of data irrespective of the closeness with 
which they approximate actual employment. On the other hand, if the anticipa- 
tions are less accurate than some mechanical, or “naive,” forecasting approach, 
they may not possess much practical value even if they prove to be very close 
to actual employment. 

Two “naive models” are used to test the external accuracy of the anticipa- 
tions. One is the simple projection of the level of the current labor force figure; 





* The full results are contained in a forthcoming monograph to be published by the Bureau of Economic and 
Business Research of the University of Illinois. 
5 Ibid. 
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the other is the projection of the current figure modified by the most recent rate 
of change in employment. The actual formulas used depend on the extent to 
which seasonal variation is present in the employment data. If relatively small 
seasonal variations are present in the data, the two “naive” projections for 
month ¢ can be represented as follows for the two-month anticipations: 


E + = Ay 
and 


’ At 
Kese = A, 3 


Avs 





Similar expressions apply for the four-month anticipations. 

Where seasonal variation is present, two procedures can be followed. One is 
to use the above expressions after adjusting the employment figures for seasonal 
variations. The other, which is used here, is to make implicit adjustment for 
seasonal variation by extrapolating the employment figure for the correspond- 
ing month of the preceding year, with or without allowance for change in em- 
ployment during the intervening period.’ By this procedure, the two naive 
models for the two-month anticipations becomes:’ 
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and for the four-month anticipations: 





Aus 


t 
A t—12 


ee Ars 1 ¢ Ae 
naomi Lf 
Att 3 (Arie 


The extent to which the labor force anticipations are more accurate than 
these naive models is shown in Table 323 for firms and for industries. The firms 
are coded in the table in descending order by size of employment, e.g., Al is 
the largest employer in the sample from the food industry and A4 is the small- 
est. The table indicates that 30 to 40 per cent of the firms (24 or 18 out of 59, 


Eis = A 














* A general-purpose model could be set up by using seasonal coefficients, as follows: 
Buse = KyAp(Sp42/8,) + Ko Ayo(S:/S;-2), 


with Ki and K;: being predetermined constants. 

This approach is not particularly “naive,” however, besides raising the problem of estimating seasonal coef- 
ficients, and was not used here. 

7 Ap-so/Acia in B’ ye and Ee, is essentially an adjustment of the change in seasonal pattern from month ¢ 
to +2, on the assumption of a constant year-to-year seasonal. The same is true af A¢_s/A¢g_i2 for the four-month 
anticipations. 








TABLE 323 


ACCURACY OF LABOR FORCE ANTICIPATIONS RELATIVE TO 
TWO NAIVE MODELS 


























Proportion of times | Proportion of times 
Nai accuracy of two- accuracy of four- 
Sedinites Firmt tr month anticipations | month anticipations 
ae on > “oe equals or exceeds equals or exceeds 
Eu: Euya”’ E, ae’ | Bias 
Food, excl. meat Al 1 43% | 80% 19% 53% 
packing A2 1 62 | §2 57 74* 
A3 1 36 38 43 79* 
A4 1 100* 76* 100 74* 
Industry l 52 65 38 68 
Meat packing Bl 1 55 83* 74* 88* 
B2 1 55 59 76* 56 
B3 1 90* 67 85* 84* 
Industry 1 65 88* 63 79* 
Printing and Cl 1 65 53 50 63 
publishing C2 2 41 41 44 44 
C3 2 81* 88* 60 60 
Industry 2 56 56 50 50 
Iron and steel ee Le 39 48 41 26 
D2 1 70* 48 82* 63 
D3 1 39 38 36 37 
D4 1 65 50 68* 39 
D5 1 52 62 64 79* 
D6 1 43 52 50 53 
D? 1 56 48 64 58 
D8 1 78* 83* 81* 67 
D9 1 60 67* 75* 64 
D10 1 100* 52 100* 75* 
Di1 1 86* 55 80* 61 
| D12 1 80* 71* 58 50 
| Industry | 1 52 40 55 61 
Cans EI 2 53 53 57 64 
E2 2 47 47 29 36 
| Industry 2 53 53 43 50 
| 
Other fabric | FI 1 70* 95* 68 71* 
metals | F2 I 68* Ff ne 52 74* 
F3 1 30 47 53 65 
Industry 1 65 79* 62 63 
Nonelectrical Gl 1 45 57 48 74* 
machinery G2 2 91* 82* 60 70 
G3 1 Fig 85* 75* 94* 
G4 1 85* 63 89* 59 
G5 1 86* 62 86* 63 
G6 1 71* 65 68 72* 
Industry 1 63 46 50 69 























* Significantly above 50% at .05 probability level. 
t Italicized firms are those preparing anticipations as simple mechanical projection of the current level of 
employment. 
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TABLE 3: Fa —-pontinned 














Proportion of times 
ic. accuracy of two- 
S Naive month anticipations 
Industry Firm ake equals or exceeds 
1se¢ 
_ ae Ess 
Farm machinery H1 2 43 42 
H2 2 71 64 
H3 2 80* 7" 
Industry 2 54 67 
Electrical Il 1 62 36 
machinery I2 1 62 60 
I3 1 61 64 
Industry 1 65 55 
Communications Ji 2 75* 69 
eqt. J2 2 75* 63 
J3 2 75* 75* 
J4 1 43 52 
J5 2 75* 75* 
J6 2 47 t3a° 
Industry 2 73* 73* 
Autos and parts Ki 1 26 44 
K2 1 100* 89* 
K3 1 61 59 
Industry 1 32 50 
Other transp. Ll 2 94* 88* 
eqt. L2 2 55 60 
L3 2 41 41 
L4 2 47 71 
Lé 2 94* 88* 
Industry 2 80* 90* 
Utilities M1 1 90* 74* 
M2 1 43 41 
Industry 1 35 47 
Retail trade Nl 2 29 35 
N2 2 67 53 
N3 2 53 56 
N4 2 53 47 
Industry 2 44 56 

















Proportion of times 
accuracy of four- 
month anticipations 
equals or exceeds 





Buss’ Bu44"’ 
50 60 
50 78* 
7 50 
75* 67 
68* 57 
56 57 
33 68 
73° 50 
73° 67 
80* 80* 
87* 100* 
32 53 
73* 80* 
71 62 
64 71 
44 56 
100* 75* 
41 75* 
35 50 
88* 94* 
78 71 
69 56 
56 69 
81* 93* 
100* 100* 
95* 76* 
68* 80* 
44 69 
63 69 
71 79* 
63 73 
50 50 
63 8i* 








t For industries with slight seasonal variation the two-month model is: 


Buy2’ = Ag; 


At 
D Ay — 





Buse" 


The four-month model is the same except that A¢_s is replaced by A; 4. 
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| variation, the two month model is: 
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In the four-month model, A:_s replaces A¢19 and } replaces 4. 
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depending on the naive model used) were able to anticipate employm<nt sig- 
nificantly better than the naive models two months ahead. Similar results are 
obtained for the four-month anticipations. On an industry basis, the anticipa- 
tions did not do as well, proving significantly superior to the naive models for 
only two or four industries, depending on the naive model used. 

The overall average of the proportion of times the anticipations of all firms 
are more accurate than each naive model, and the overall average of the propor- 
tion of times the anticipations for all 14 industries are superior to each naive 
model, are both in the neighborhood of 60 per cent. 

The extent to which relative accuracy is an industry phenomenon is not too 
clear from these data, for the firms whose anticipations proved generally more 
accurate than the naive models are fairly well scattered among the different 
industries. More pertinent seem to be the interrelations between relative ac- 
curacy, extent of seasonality in employment and the method apparently used 
by the firm to prepare the anticipations. Although the individual firms could 
not be contacted, it was obvious from the data that many of them were taking 
as their anticipation the current level of their employment, i.e., E.42= A, oc- 
casionally rounded off to the nearest 0 or 5, or generally carrying forward the 
previous anticipation; the code letters of these firms are italicized in Table 323. 
It is significant that these firms are heavily concentrated in industries charac- 
terized by little seasonal variation in employment; this is true of 19 of these 
27 firms. To take a different perspective, 53 per cent of all firms in industries 
with little seasonal variation resorted to mechanical projection as compared 
with 35 per cent of firms in the other industries. Moreover, the accuracy of the 
anticipations varies noticeably with these two factors, as is brought out by the 
following tabulation, showing the proportion of firms in each category whose 
two-month anticipations outperform the naive model more than half the time: 




















Model I 
Mechanical projection Other means 
at , Substantial seasonal 50% 79% 
Little seasonal 84 61 
Sat caticen aaa : Model IT 





Mechanical projection Other means 














Substantial seasonal 63% 
Little seasonal 84 


79% 
72 











The results for the four-month anticipations are essentially the same. 

Thus, it appears that, on the one hand, firms in industries characterized by 
little seasonal variation were more likely to utilize simple mechanical projection 
in their forecasts and, on the other hand, these forecasts were substantially 
better than other means only in these industries. This is not unexpected, par- 
ticularly since during the period studied amplitude of seasonal variation of em- 
ployment was a dominant factor in total amplitude of employment.* Hence, 





§ R. Ferber, Op. cit. 
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mechanical projection is likely to possess high relative accuracy where seasonal 
variation is relatively low and is likely to yield poor results in other situations. 

Overall, Table 323 indicates that the anticipations of firms relying on methods 
other than simple mechanical projection turn out only slightly better than those 
of firms relying on mechanical projection. However, when a similar analysis is 
carried out taking into account the magnitudes of individual errors, the margin 
of superiority increases substantially.* Thus, it appears that these other meth- 
ods are most successful primarily in forestalling relatively large errors rather 
than in terms of the frequency of relative accuracy. 

Correlation Tests. If the anticipations contain some consistent bias, the ac- 
curacy of the anticipations in gauging change can be improved by correlating 
the actual and anticipated rates of change. If little seasonal variation is present 
in the data, this can be done by correlating A12/A, with E,,2/A, for the two- 
month anticipations, and A144/A, with £,,4/A;, for the four-month anticipa- 
tions. However, this is not a desirable procedure if pronounced seasonal varia- 
tion exists in the data, for the common influence of the seasonal on both the 
actual and the anticipated data tends to produce spuriously high evidence of 
accuracy. Hence, for the six industries and one other firm in the sample where 
pronounced seasonal variation was detected, the data were adjusted for sea- 
sonal (using the link-relative method) before correlating the actual and antici- 
pated rates of change with each other. 

The results of carrying out these correlations for the individual firms and the 
employment totals of each industry are presented in Table 327 for both sets of 
anticipations. The coefficients of determination between the actual and antici- 
pated rates of change for the different firms in an industry are shown in an 
adjoining column but on the same line as the coefficient for the industry. As 
before, the coefficients for the firms are presented in order of decreasing em- 
ployment, so that figures in corresponding positions of the two “individual 
firms” columns pertain to the same firm and are comparable with data for the 
same firms in Table 323. 

The fundamental difference between the naive model approach used previ- 
ously and the correlation test lies in the fact that in the first instance the 
anticipated change is used as reported, whereas the correlation test compares 
the actual change with the “best” linear function of the anticipations, “best” 
being defined in a least squares sense. It is because of this allowance by the 
correlation process for linear bias that the naive model tests and the correlation 
approach may yield different results. 

The general picture presented by Table 327 is one of poor correspondence be- 
tween anticipated empioyment and actual employment. At the industry level, 
a significant correlation between the two sets of data is apparent for only seven 
of the fourteen industries for the two-month anticipations and five of the 
fourteen industries for the four-month anticipations. The latter five industries 
are included in the former: a significant four-month correlation is accompanied 
by a significant two-month correlation. On the whole, the two-month anticipa- 





» Ibid, 
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TABLE 327 
CORRELATION BETWEEN ACTUAL AND ANTICIPATED RATES OF 
CHANGE IN LABOR FORCE, FOR INDUSTRIES AND 
INDIVIDUAL FIRMS 








Coefficient of determination between actual |Coefficient of determination between actual 
and anticipated change two months ahead | and anticipated change four months ahead 





Industry Individual firms | Industry | Individual firms 





No seasonal adjustment 





Food, excl. meat packing d .38, 40**, .09, .00 d .08, .35**, .10, .00 

Meat packing 19, .40**, .08 13, .42™, .65** 

Iron and steel ‘ -08, .06, .10, .01, (—).10, .00, ‘ .02, .02, .09, .03, (—).07, .02, 
09, .42%*, .00, (—).06, .24* .00, .23*, .00, (—).09, .28*, 
.25** 29% 

Other fabricated metals 4 51, 35, .14 F .06, .21*, .08 

Nonelectrical machinery ‘ 42**, (—).04, 59, .39**, - .31**, .09, 69%, 447%, .3s™ 
39", .09 13 

Electrical machinery F 40**, 49**, .21* d .58**, .45**, .17* 
Automobiles and parts } .35**, .00, .10 . -26*, .00, .25* 

Utilities d 17, .61°* ‘ 45%, 46°* 





Seasonally adjusted 





Printing and publishing r 61", .07, .17 J .64**, .08, .25* 

Cans . .25*, .01 e A2**, 06 

Farm machinery * .00, .09, .08 J 20, .06, .04 
Communications eqt. é .27*, .45**, .25*, .02, .08, .08 : 48", 56°, 36", (—).07, 
06, .06 

Other transp. eqt. - 48**, .56**, .66, .19*, .66** d .22*, 43°, .61, .25%, 63° 
Retail trade ‘ .03, .34**, .16, .20* (—).01 .20, .26*, .17, .69** 


total 

















Note: One asterisk indicates significance at the .05 probability level; owe, at the .01 level. 
Italicised figures represent firms using simple h 1 proj 
Negative signs in parentheses preceding a coefficient indicate negative correlation. 





tions gauge the actual change in employment better than the four-month 
anticipations, as one would expect.'® 

The ability of the individual firms to anticipate correctly the change in actual 
employment varies widely. Roughly half of the firms in the sample were able 
to anticipate the change in their employment more closely than would be an- 
ticipated as a result of chance. Most of these firms, however, were in one of five 
industries—other fabricated metals, nonelectrical machinery, electrical ma- 
chinery, communications equipment, and other transportation equipment 
With the exception of communications equipment, the anticipations of move 
than half the firms in each of these industries were correlated significantly with 
actual changes in employment, for both the two-month and the four-month 
anticipations. Interestingly enough, each of these industries turns out pro- 
ducers’ durable goods. 





” An alternate approach, suggested by a reviewer, is to incorporate a seasonal variable in the correlation, 

puting the regression of the actual change in employment on the anticipated change and on the ratio of seasonal 
coefficients for the months in question. This procedure makes possible an assessment whether the anticipations have 
any net predictive value over and above the seasonal factor. Exploratory calculations using this technique serve 
to reinf the inf drawn from Tabie 3. The net correlations between the actual and aaticipated changes 
are somewhat lower than the gross changes, particularly for firms and industries with pronounced seasonal variation, 
but generally remain statistically significant if they originally were so. 
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As a general rule, firms that are able to anticipate changes in employment 
better than chance two months ahead are able to do the same four-months 
ahead. In fact, slightly more firms were able to anticipate successfully on this 
basis four months ahead than two months ahead! Of the 59 firms shown in 
Table 327, 30 sets of four-month anticipations were correlated significantly with 
actual changes in employment as compared with 28 sets of two-month anticipa- 
tions; 26 firms were successful on both bases. In most cases, however, the close- 
ness of the relationship is not at all high. 

On the whole, significant correlations on an industry basis tended to occur 
where the actual and anticipated rates of change were correlated significantly 
for half or more of the firms in the industry. Mathematically, there need be no 
relationship between these two factors, and in fact, Table 327 reveals some no- 
table exceptions. Thus, the two-month anticipations of only three of the twelve 
iron and steel firms—and all three were relatively small, were significantly re- 
lated to actual change but the correlation between the aggregated magnitudes 
was statistically significant. On the other hand, although four of the six non- 
electrical machinery producers were able to anticipate changes in their employ- 
ment better than chance four months ahead as well as two months ahead, the 
industry magnitudes showed no correspondence on either base." 

Easily the most important distinguishing characteristic of the successful 
firms was the method used to prepare the anticipations. Of the firms using one 
of the two forms of mechanical projection, only 5 of 27, or 19 per cent, antici- 
pated two-month changes in employment better than chance whereas of the 32 
firms using some other method, 23, or 72 per cent, were “successful” in this re- 
spect. In contrast to the accuracy of the level of the anticipations, the method 
used is clearly a fairly sensitive indicator of the ability of the firms to anticipate 
rates of change. 

Direction and Turning Point Tests. Considerable care must be exercised in 
comparing direction of change lest the results be influenced by the same serial 
correlation effects we are trying to avoid. A purely mechanical approach of pro- 
jecting into the future the same direction of movement in employment as oc- 
curred in the recent past would have been correct, in this study, more than half 
of the time purely because of this serial correlation effect.” 

To abstract from this difficulty, comparisons are shown in Table 329 not only 
on the basis of ability to anticipate correctly increases or decreases in employ- 
ment but also on the basis of ability to anticipate correctly reversals in employ- 
ment. Table 329 presents four statistics for each firm in the sample as well as for 
the cumulated industry anticipations. These are the proportion of times an in- 
crease in employment was correctly anticipated, the proportion of declines 
correctly anticipated, and also the corresponding percentages for reversals in 
employment—upturns and downturns. Reversals are defined as changes in the 
direction of movement of employment from the most recent movement, after 
adjustment for seasonal if any, and are therefore included in the first two com- 





1 It might be noted at this point that substantial differences were apparent between industries in the average 
variability of actual employment from month-to-month, even after seasonal adjustment—the range for industries 
being from less than one per cent for utilities to over 6 per cent for meat packing. However, the correlations in 
Table 3 showed little overall! relationship to these differences. 

12 Based on tabulations of the length of runs in the direction of change for the 14 industry totals. The actual 
percentages varied between 45 and 70%, with most between 50 and 60%. 
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TABLE 329 


ACCURACY OF TWO-MONTH ANTICIPATIONS IN GAUGING THE 
DIRECTION OF MOVEMENT OF EMPLOYMENT, BY 
FIRM AND BY INDUSTRY 








Industry 


Proportion of times 
direction anticipated 
correctly on 


Proportion of times 
reversals gauged 
correctly on 





Increases Decreases 


Upturns | Downturns 








Food, excluding meat Al 


packing 


A2 
A3 
Ak 


82*% 
67 
100 
0 


82*% 
69 

0 

0 


86% 
100*t 
100* 
.00 


86% 

75T 
.00 
-00 


Industry 





Meat packing Bi 
B2 
B38 

| Industry 


Printing and Cl 
Publishing 1. ee 
C3 


| Industry 


Iron and steel D1 
D2 
D3 
Dé 
D5 
D6 
D? 
D8 
Dg 
D10 
Di1 
D12 
Industry 

El 

E2 


Industry 


Other fabricated Fl 
metals F2 
F3 
Industry 











50 
100* 
15 


83* 
85* 
67 








100* 80 


40 
20 
.50T 
60 


100* 
57 
100* 
57 


Ot 
50 
100* 


60 
67T 
80 





80 
75t 
00* 


100*t 


100*| 





* Significantly above .50 at .05 probability level. 


t Based on less than five observations. 


* Italics indicate firms using mechanical projection. 
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TABLE 329—(continued) 











Proportion of times Proportion of times 
direction anticipated reversals gauged 
Industry correctly on correctly on 





Increases | Decreases Upturns | Downturns 





Nonelectrical Gl 100* 100*t Ot 
machinery G2 Ot 100¢ 
G3 94* 67t 
G4 86 507 
Gé 67 50t 
G6 75 29 25t 

Industry 


Farm machinery H1 OT j 3¢ 50t 
He Ot ; 100* 
H8 33t 80 

Industry 





Electrical machinery Ii 100* 5 67 
I2 | 100* ot 
I3 100* 
| Industry | 








Communications a 71 
equipment J2 88* 


100* 


Industry 


Autos and parts Kl 
Ke 

K3 

| Industry | 





Other transportation | Li 
equipment L2 
L3 
L4 
Lé 
Industry 


Utilities M1 
M2 
Industry 


Retail trade Nl 

N2 

N3 

N4 
Industry 




















* Significantly above .50 at .05 probability level. 
t Based on less than five observations. 
* Italics indicate firms using mechanical projection. 
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parisons as well. As in Tables 323 and 327, the firms in each industry are listed 
in order of decreasing size. As before, code letters of firms relying on simple 
mechanical projection are italicized. 

The asterisks in the table indicate which of these percentages significantly 
exceeds 50 per cent at the .05 probability level or beyond, thus indicating the 
extent to which the anticipations are superior to the accuracy that might have 
been obtained as a result of chance. It is important to stress, however, that this 
interpretation is subject to qualification with regard to the anticipation of in- 
creases and decreases because of the serial nature of these phenomena. 

In an aggregate sense, the results in Table 329 can be summarized in the fol- 
owing three lines: 








Decrease | Upturn | Downturn 





Number of firms with “‘success” ratios 
significantly above .50 ; 5 20 5 
Per cent of all firms 8% 8% 
Number of industries with “success” 

ratios significantly above .50 | 6 0 7 1 

















It is clear from these data that at best only one-third of the firms were suc- 
cessful in anticipating increases or upturns in employment better than what 
would be expected as a result of chance, and that only a small fraction of the 
firms were successful in this regard when it came to declines or downturns in 
employment. At the same time, when the firm anticipations are aggregated, 
roughly half of the industry anticipations successfully predict increases or up- 
turns in employment better than chance but almost none of the industry an- 
ticipations did so when employment moved in the other direction. 

Table 329 indicates that in general a negative correlation exists between a 
firm’s ability to anticipate upward movements and its ability to anticipate 
downward movements. Thus, 23 firms were able to anticipate either increases 
or decreases in employment better than chance, but only three were successful 
in both respects! The same is true of upturns and downturns. This rather re- 
markable characteristic would seem to indicate that the firms with the more 
successful anticipations in this regard are not necessarily the more astute but 
rather are those forecasting most consistently in a particular direction! If up- 
ward movements are combined with downward movements, we find that only 
three of 59 firms were able to anticipate the direction of change significantly 
better than chance. 

As before, the method of anticipation was important, as shown by the 
following tabulation: 








Increase Upturn | Downturn 





Proportion of firms using mechanical 


projection and anticipating direction 
better than chance 3/27 
Proportion of firms using other methods 
and anticipating direction better than 
chance 18/32 
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STRUCTURE OF THE ANTICIPATIONS 


The extent to which anticipations can in turn be anticipated is a central issue 
in any study of such data, for if they can be anticipated with a high degree of 
precision their net value for analytical purposes is likely to be very small. In 
at least one study, this has proved to be the case."* An important corollary 
question which has hardly been touched, however, is the extent to which hy- 
potheses explaining industry anticipations are equally successful in explaining 
the anticipations of the individual firms that comprise the aggregate. Does the 
hypothesis explaining industry anticipations typify individual firms as well, or 
is it in effect some hybrid phenomenon representing a conglomeration of diverse 
structures of firm anticipations but representing none of them? 

In the analysis of the Illinois employer’s labor force anticipations, six differ- 
ent hypotheses were advanced to explain the structure of the anticipations, as 
shown in Table 333." Each of these hypotheses was applied to the anticipations 
of the appropriate firm and industry total in turn, according to the seasonal 
amplitude of the data, by means of a series of reliability tests ending in multiple 
regression analysis. The results indicated the extent to which each hypothesis 
was successful in explaining the structure of the anticipations of particular 
firms and industries. We are therefore able to compare the extent to which a 
hypothesis successfully explaining industry anticipations is equally successful 
for the individual firms in the industry. A summary comparison of this type is 
shown in Table 334. 

Two findings relative to the diverse nature cf the results are evident in 
Table 334. One is that considerable variation exists with regard to which hy- 
pothesis best explains the structure of a particular industry’s anticipations, at 
least of those considered here. The only hypothesis which appears with any 
regularity is F, the mechanical extrapolation hypothesis. This result may not 
be unexpected considering the substantial differences in seasonal variation be- 
tween employment in different industries. 

The other finding, of more relevance to the subject at hand, is that a hy- 
pothesis which successfully explains the structure of anticipations of a par- 
ticular industry seems as likely as not to represent the structure of anticipations 





3 Ferber, Robert, The Railroad Shippers’ Forecasts, Chapters 4-5. This is especially so if lack of predictive 
value is also demonstrated for the residuals from the regression of the anticipations on past experience, as was the 
case with the railroad shippers’ forecasts. 

4 A discussion of the theoretical basis for each of the hypotheses is contained in the forthcoming monograph 
on this subject to be published by the Bureau of Economic and Business Research of the University of Illinois. It 
might be noted that these hypotheses cover a wider range of possibilities than may be evident at first sight, for many 
of the more complex hypotheses are but extensions of one of these. If the simpler form is not supported by statis- 
tical tests, more complex forms can also generally be safely disregarded. 

% To reduce the t of calculation, only those hypotheses passing certain “elimination tests” were sub- 
jected to regression analysis for each firm and industry. Thus, for (A) to reach the multiple regression stage, it had 
to satisfy the following two conditions: (1) the correlation between E4,:— A; and A;—A¢_s should be statistically 
significant at the .05 probability level; (2) the value of r* in the preceding case should exceed the corresponding 
correlation between A¢,2—Azand A;—Az_-+ by at least .05. The latter condtition is somewhat arbitrary, necessitated 
by the failure of the usual significance tests to discriminate between the two correlations, apparently because of 
the small sample sizes involved. For each firm and industry total in which hypothesis A satisfied these two condi- 
tions, the operational form of A was estimated by least squares using Zz,2 and A¢,: as alternate dependent variables. 
If the least square form of (1) with 2:4: dependent survived certain additional teste—primarily similar forms of the 
above two and that the estimate of ¢ be statistically significant while the estimate of b not differ significantly from 
1.0—the hypothesis was accepted for that particular firm or industry. Note that if any of the first five hypotheses 
satisfy all the requirements but c is not significantly different from sero, we are eutomatically left with hypothesis 
F. A discussion of these tests is included in the p tioned in the preceding f 
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of a minority of firms in the industry. In only four of the 13 instances in which 
a hypothesis successfully explained the industry structure of anticipations was 
it equally successful in explaining the structure of anticipations of more than 
half of the firms in the industry (though in one other industry just half were 
explained—one out of two). Furthermore, the degree to which a hypothesis is 
successful in explaining industry structure, as measured by the coefficient of 
determination, does not seem to provide much indication of the extent to which 
the same hypothesis explains the anticipations of the firms in tlie industry. 
Thus, in three of the six instances in which a hypothesis explained more than 
90 per cent of the variation in industry anticipations the same hypothesis was 
acceptable for the anticipations of a minority of the firms in the industry; and 


TABLE 333 
ALTERNATIVE HYPOTHESES ON STRUCTURE OF ANTICIPATIONS 








Amplitude of ] 
ee a . ere Hypothesis t Operational formf 
variations* 





Slight (A) Ait+(A:—Atz_2) at+bAi+cAt_s 
btthe. Ae 
Slight (B) A. (i+ raed) a+bArte — 


Ais At-2 


Substantial (C) Ad’ +(A;’ —A?’_2) a+bA;’+cA:_2’ 


A,’ —A;?'_ A,'’2 
Substantial (D) A,’ (1 For Bom =) a+bA;'+c ye 


At?’ t’-2 


A,—Aiu_ A 
Both (E) Avs (1+ ~ wane *) at+bAi_10+cAt_i0 ‘A - 


Ai 42 t-12 


Both | (F) A: a+bA, 











* Based on the derivation of seasonal indices by the link-relative method, the boundary between “slight” and 
“substantial” amplitude being 10 per cent variation from trough to peak. 
t Dependent variable in all instances is F;,». 
Notation: A¢—», denotes actual labor force n months ago. 
E142 denotes anticipation two months ahead, prepared in month ¢. 
Primes denote seasonally-adjusted values. 


the hypothesis with the lowest industry coefficient of determination (iron and 
steel) proved successful on seven of the 12 firms in the industry, producing 
coefficients of determination for those seven firms ranging between .90 and 
.998.'* 

The indifferent success of the acceptable industry hypotheses with the indi- 
vidual firm anticipations points to the very real possibility that industry an- 
ticipations represent at times a conglomeration of diverse structures of indi- 
vidual firm anticipations, as noted in the opening paragraph of this section. 
This is borne out by further examination of Table 334, which reveals considerable 
disparity in the structure of the anticipations of the different firms in the same 
industry; indeed, in this respect there is at least as much variability within 
most industries as between them. Whether or not a particular hypothesis is 





% The three instances where a successful industry hypothesis also explains the anticipations of most or all of 
the firms in the industry all involve the simplest hypothesis, F. Therefore, the implication of the fact that the esti- 
mates of the slope, 6, for the firms are close to the corresponding estimate of b for the industry remains a moot point. 
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TABLE 334 


SUCCESS OF HYPOTHESES IN EXPLAINING THE STRUCTURE OF THE 
ANTICIPATIONS OF INDUSTRIES AND FIRMS 








Hypothesis acceptable Other hypotheses 
for industry total acceptable for firms 
in given industry 





Industry Proportion of 

Hypoth- R: firms in industry 
esis for which 

hypothesis holds 





Hypoth- Number 
esis of firms 





Food, excl. meat packing , : 1 of 4 1 of 4 
0 of 4 


Meat packing 1 of 3 
Printing and publishing -- 1 of 3 
Iron and steel . 7 of 12 — 

Cans a 1 of 2 
Other fabricated metals ° 9% 1 of 2 
Nonelectrical machinery ; 5 of 5 
Farm machinery .§ 3 of 3 
Electrical machinery ; 1 of 3 
Communications eqt. ‘ 1 of 5 


Autos and parts ‘ 2 of 3 
‘ 1 of 3 
Other transp. equipment : 1 of 5 
‘ 1 of 5 
Utilities — 

Retail trade . 0 of 4 














applicable to both industries and firm anticipations cannot be deduced from 
results pertaining to the former alone, even apparently if well over 90 per cent 
of the variation in industry anticipations is explained by that hypothesis. 


IMPLICATIONS OF RESULTS 


This study reveals considerable disparities between the nature of results ob- 
tained at the individual firm and industry levels with regard to the accuracy 
and structure of businessmen’s anticipations. The anticipations are clearly 
more accurate at the industry level, a superiority which is attributable in the 
main not to equally high accuracy of individual firm anticipations but rather 
to compensating errors of anticipation by the individual firms. Similar results 
have been obtained in the analysis of capital expenditure plans.'’ 

Also of considerable interest is the finding that very few firms were able to 
forecast employment consistently better than chance tn both directions. Firms 
with the most accurate prediction records were generally those which pre- 
dicted changes consistently in the direction which coincided with the trend of 
their employment (generally upward) ; a change in employment counter to this 
trend was invariably missed. At the same time there is evidence that firms 


17 See papers by Keeser and by Foss and Natrella in forthooming proceedings of Conference on Anticipatory 
Data, held by the Universities-National Bureau Committee at Princeton, Nov. 8-9, 1957. 
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which make some effort to anticipate employment, i.e., firms that do not con- 
sistently mechanically extrapolate current levels, are more accurate than other 
firms on the basis of tests of relative accuracy.'* In an absolute sense, such firms 
do show larger errors of the forecasts, but only because of a tendency for 
mechanical extrapolation to be used more frequently in industries characterized 
by relatively low fluctuations in employment during the period studied. 

This study also provides corroborative evidence for a hypothesis advanced in 
connection with the railroad shippers’ forecasts to the effect that the structure 
of individual firm anticipations may be wholly dissimilar to that of the aggre- 
gate; and that the latter represents an average of two opposite types of indi- 
vidual anticipations—mechanical extrapolation of level and relatively accurate 
non-mechanical forecasts.'® The findings of this study point to the existence of 
several, rather than two, different types of forecasts insofar as structure is 
concerned, with mechanical extrapolation of level as the predominant type. 
With a few exceptions, these different types intermingle in each industry so 
that the industry-derived structure of the anticipations is likely to bear little 
relation to that of most of the individual firms. As the level of aggregation rises 
to a regional and national scale, the dissimilarity between aggregate and indi- 
vidual-firm structure will undoubtedly increase. 

From a more general point of view, this study serves to highlight a basic 
difficulty of these and other types of anticipations solicited from the outside. 
If the particular data requested are germane to the firm’s plans and operations, 
serious attempts at accurate forecasts (resulting in, apparently, more successful 


forecasts) can be expected. If the data appear to be external to the firm’s opera- 
tions, there is likely to be an inherent inclination to comply with the request 
with the least effort. The problem of obtaining useful expectations data, there- 
fore, entails not only the construction of a meaningful set of concepts from an 
analytical point of view but also of fitting these concepts into the operational 
framework of the firm. 





8 This is corroborated by a similar study of the Indiana labor force anticipations supplemented by personal 
interviews conducted by Professor William Andrews of Indiana University. Firms that were considered to be making 
some effort to anticipate employment were generally much more accurate than other firms. 

19 Hearings Before the Subcommittee on Economic Statistics of the Joint Committee on the Economie Report, 
Report of Federal Reserve Consultant Committee on General Businese Expectations, 1955, pp. 670-571. 











AN INDEX OF MANUFACTURING PRODUCTION 
IN NEW ENGLAND* 


Harry BENJAMIN ERNST 
Tufts University 


Part of the current production estimates of the Federal Reserve 
Board are based on projected and interpolated productivity relation- 
ships which are two or three years old. An alternative method is out- 
lined which is in three parts. 

1. Annual relationships are estimated for kilowatt hours, production 
worker man-hours and output, based on plant data obtained by 
mail survey. 

2. Man-hours and KWH are collected by the 20th of each month for 
specific two and three digit Standard Industrial Classification In- 
dustries. 

3. The annual productivity relationships are applied to the monthly 
man-hours and KWH in making current, monthly production 
estimates for specific industries. 

There is still a lag in obtaining annual relationships, but the monthly 
KWH and man-hour data are helpful indicators of “scale” or “tech- 
nological” changes which affect productivity. Illustrative but prelimi- 
nary total manufacturing indices for New England are presented. 


n any highly industrialized region or economy, current measures of manu- 

facturing output are of great value. Manufacturing output is significant 
both because it is a large part of total output of goods and services and because 
of its interrelations with other sectors of the economy. Until 1955, employment 
data were the chief source of manufacturing information in New England. Ex- 
cept for very short time periods, total employment data are unsatisfactory indi- 
cators of industrial output because output also depends on the number of hours 
worked and on the technological and organizational structure of the industries 
involved. It is true that an activity index could have been assembled from the 
Bureau of the Census, “Boot and Shoe Production,” “Cotton Spindle Activity” 
or other series for New England. However, there are only a few such indicators 
and no broad regional measure of output comparable to the Federal Reserve 
Board’s Index of Industrial Production. In view of this, the Federal Reserve 
Bank of Boston has been financing research relating to detailed measures of 
the region’s manufacturing output. The basic research is being carried out at 
the Institute for Economic Research, Tufts University. 





* This paper explains procedures used in constructing the New England Indices of Manufacturing Production 
at the Institute for Economic Research, Tufts University, under a research grant from the Federal Reserve Bank 
of Boston. The paper describes the problems and general methodology of the research. Only preliminary, very 
crude results are presented. The indices are being prepared for the New England region as a unit, and for the states 
of Maine and Massachusetts separately. Much credit should go to staff members Daniel Ounjian, Gerald B. Dahill, 
Allan Gard and Ira Stepanian; also to Lewis F. Manly of Tufts University; George H. Ellis, Director of Research, 
Federal Reserve Bank of Boston; Wendell D. MacDonald, Director, and Edward T. O’Donnell ui the Boston Office, 
Bureau of Labor Statistics; Joseph F. King, Director, Massachusetts Department of Labor and Industries; R. Leigh 
Fitzgerald of the New England Electric System and William J. Ray of the Electric Council of New England, as 
well as others too numerous to mention. 
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Since the FRB model is closest to the new model, a brief description of some 
aspects of the FRB methods follows. About fifty per cent of the FRB monthly 
output indexes are estimated from changes in the production worker man- 
hours reported by the U.S.Bureau of Labor Statistics. Chart 338, reproduced 
with the permission of the Federal Reserve Board, Division of Research and 
Statistics, illustrates the method for a hypothetical industry. Essentially, the 
relationship between output and man-hours is determined annually, and then 
nterpolated ‘n the form of a smooth curve within given years. Extrapolation 
is also necessary since annual data lag current monthly man-hour inputs by 
two or three years. Annual relationships are applied to monthly variables in 
both the interpolative and extrapolative procedures. One must assume that 
the relationships between output and man-hours change smoothly within years 
and the relationships may be extrapolated smoothly through time.' As an alter- 
native to the FBR procedures, there is the model described below. 


I. THE CONCEPTUAL MODEL FOR THE NEW INDEX OF 
MANUFACTURING PRODUCTION 


As with the FRB index, annual-monthly relationships are used in the new 
model, but with essential differences in the basic units of measurement. 
Monthly production worker man-hours are supplemented with kilowatt hours 
for specific industries. Annual man-hour and kilowatt hour input data are used 
plus deflated value of production, (because physical output data are not avail- 
able either monthly or annually). The two monthly input series are compiled 
for two-digit SIC classifications (and in some cases, three-digits) for Massa- 
chusetts, Maine and the remainder of New England. 

From annual plant, man-hour, kilowatt hour and deflated VOP data, pro- 
duction functions are constructed for each important industry or subindustry. 
The basic form of the production function takes into account the factors of 
“technological” and “scale” changes.? As a working hypothesis, if the level of 
technology is known (as indicated by a KWH/MH ratio in those industries in 
which KWH are an important input), one has an essential, identifying char- 
acteristic of a production function. Once having specified a production function, 
the scale effect (increasing or decreasing returns) can be separately determined. 

Two situations are included in our concept of “scale,” which are part of 
the working hypothesis. First, in some industries, there might be changes in 
the volume of production because of changes in the number of shifts worked, 
with no changes in the stock of capital equipment itself. On the assumption 
of constant capital stock, changes in output are a matter of the degree of 
utilization of the capital equipment and any change in labor productivity 





1 In a paper presented at the Annual Meeting of the American Statistieal Association at Atlantic City on Sep- 
tember 11, 1957, Milton Moss of the Federal Reserve Board, Research Division, presented tests of the interpolative 
assumption in those industries for which physical output and man-hours were available monthly. Estimates of pro- 
duction resulting from a smooth interpolation (within years) of annual output and man-hour relationships did not 
differ significantly from the actual physical output. However, these results pertained to aggregates of industries and 
not specific industries, and only to the interpolative procedure. 

2 Technological and scale effects have been mentioned by many writers. For example, G. W. Wilson, “An In- 
fluence of Demand Upon Productivity,” Canadian Journal of Economies and Political Science, Aug. 1956, Vol. 
XXII, No. 3; p. 379; and Witt Bowden “The Productivity of Labor in Great Britain,” Journal of Political Economy, 
June 1937, Vol. 45, No. 3, p. 347. 































































































































































































3 
ie) 
£ iSel 66! Lv6l Sel 66! Lv6i Stel 
- io 08 
a 4 -_ 4 
Z ool eo plo vars pat 001 
§ = ra s = ee _ 
5 ow +44 < oz 
7 LA 
= (2X AI) iNdinO yooisAyg AjyjuoW peyowijsy IH (II+I) anoyuow pajsodey 40d yndjnO jonuuy TT 
< 
g os os 
- 5 ~ 
2 oo [AW WY Aa a = 00 
3 3 Nat ide OF SEH 
£ 02! v 02! 
z SanoyuoW paysodey AjysuoW 7 SunoyUOW peysodey jonuUy ITT 
<a 
5 «6008 os 
Y ) ee bo + 
5 001 << = oat 001 
4 — 
= . tae 4 La . 
7 2 =e oz! 
e- $40j904 juewssnipy AjyjuoW pe,DjOdse4u} xn INdjNO jOd1SAYyg jOnuyY T rv 
IS6! ‘6Z sequieseg “‘ssoW ‘UOjsOg ‘UOI;DIDOSSY jOD1SI}DISG UONVeWY jo Bulyeow 
jonuup , SJUBWAOJOASQ sEQWNN xepul, UO UOISSS 404 ‘wejskS eAseSey jOJEPe4 OYy JO S1OUIBADD jO ps00g 
*$914S11D5S puo yos0esey 40 UOISIAIQ ‘ppjosg ““) snyysy pud poojqensy, 4 uDW07 Aq pesDdeid sedog 
% ,UOIINPOsg [OI4ISMpU; 4O KOPU; SAJESeY jO4OPE4 4O UOISIAZY Oyj UI S$8s504g, -JOUNOS 
OO! = 8601900 6b -Lh6! ‘sexepuy jiV 
X3Z0N! 3AN3S3Y WH3034 Ni NOILVOdYuSLNI 4YOJ VIVO YNOHNVW JO 3SN 
P see LUVHO 














INDEX OF MANUFACTURING PRODUCTION 339 


should properly be associated with the “volume effect.” In other industries or 
time periods, there could be increases in the stock of capital equipment, as an 
extension of the capital stock already employed (which is more consistent with 
the concept of “scale” changes). A special case of “scale” changes would be io 
decrease plant or industry capacity by failing to replace capital stock as it 
wears out. For “volume” as against “scale” changes, one would expect to find 
different relationships, at the plant level, between KWHs, man-hours and out- 
put.* If working with sufficiently disaggregated data, one would expect to find 
similar relationships at the industry level due to competitive plants imitating 
one another’s technology. In cases of “volume” (but not“scale”) changes, the 
functional relationship of KWH and man-hours ought to have a positive inter- 
cept because of the fixed elements in KWH which are not directly associated 
with production changes (e.g. air conditioning, lighting, etc.) “Scale” changes 
ought to result in nearly proportional KWH and man-hour changes over time 
and more nearly homogeneous functions. 

On the other hand, “technological change” is defined to include the intro- 
duction of a new kind of capital equipment to the production process. To 
illustrate, if a woolen and worsted broadcloth plant expanded capacity by 
buying additional mules, this would be a “scale” change. However, a shift from 
mules to more modern spinning frames would be a technological change. 
Typically, technological changes result in a much larger amount of electric 
power consumption and a new KWH and man-hour relationship. Technological 
changes, therefore, ought to be associated with shifts in the KWH/man-hour 
functions for plants and industries—but not necessarily with changes in the 
KWH/man-hour ratios, due to the non-homogeneous nature of the KWH/man- 
hour function. 

There are two basic forms of production function, either of which might be 
chosen on an empirical basis, industry by industry. 


1. Output/ MH =fKWH/MH, MBH) which is a productivity function. It is 
hypothesized that the relationship of the independent variables will 
indicate whether “volume,” “scale,” or “technological” changes are in- 
influencing labor productivity. 

2. Output =f(KWH, MH) which is the usual production function.‘ In this 
case, the KWH/man-hour ratio could be used as the criterion of “tech- 
nology” or “scale” changes and either or both the KWH and man-hour 
inputs used to estimate the “volume” effect.® 


Having selected the form of production (or productivity) functions on the 
basis of the annual plant data, the annual relationships are used together with 
monthly input indexes to make monthly estimates of production changes. In 
a given month, production is estimated from man-hours (like the FRB) but also 


* Empirically, on the basis of KWH and man-hour data for specific industries, one may be able to separate 
“volume,” “scale,” and “technological” changes, each having a different influence on labor productivity. 

4 E.g. P. H. Douglas, The Theory of Wages (New York 1934) pp. 209-11. 

5 Two sets of scatter diagrams of plant data have provided the basis for selecting the first or second type of 
function. On the first set of diagrams, output/man-hour was regressed on man-hours and then KWH/man-hours, 
On the second set of diagrams, output was regressed on man-hours and then kilowatt hours. The (multiple) correla- 
tion coefficients are the final criteria for selecting either type of function, based on linear (multiple) regression 
functions. 
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from the additional input of KWH (which of course, affects the level of pro- 
duction and labor productivity). It is expected that the addition of monthly 
KWH data will permit more accurate interpolation and especially extrapola- 
tion of output/man-hour relationships, even though the application of annual 
relationships to monthly data still entails some smoothing of the output 
estimates. 


Il, CONCEPTUAL PROBLEMS ASSOCIATED WITH THE NEW MODEL 


First, in using both man-hours and KWH in the same function, one must 
recognize a certain amount of non-comparability in the two series. There are 
some well-known limitations to using production worker man-hours to reflect 
month to month production changes. The Bureau of Labor Statistics man- 
hours are those paid for but not necessarily worked, causing difficulties in 
vacation periods and months during which major holidays occur. The man- 
hours also cover the payroll period (usually a week) in the middle of the month 
—which might not always be representative of the activity during the other 
three or three and one half weeks of the month. On the other hand, KWH data 
reflect activity for the entire month (and the actual activity, in fact, since KWH 
are not paid for unless used). Furthermore, there are probably fewer discon- 
tinuities in output/KWH than in output/man-hour relationships—for exam- 
ple, output/man-hour relationships can change during slack periods due to 
management’s desire to keep the work force intact. In most industries, month 
to month changes in activity are probably more accurately reflected by KWH 
than by man-hours. This of course, is not true in the shoe, paper box or other 
industries where power consumption per man-hour is relatively insignificant. 
Furthermore, because of the non-comparability of the two inputs, one must 
be careful in assessing short run shifts in KWH/man-hour ratios. 

The choice of data transformation and productivn function model creates a 
second kind of problem. At least three basic models could be used. The first 
might utilize a purely cross-sectional approach—-for example, regressing output 
on both inputs, with the data in raw form. This method is not satisfactory be- 
cause of wide differences in the average productivity of plants. Shifts in the 
importance of plants (some growing, some declining) produce a “shot-gun” 
effect on a scatter diagram—aside from any changes ia the plant input-output 
relationships themselves. As another objection, if large and small plants had 
different input-output relationships, a function of this type would reflect these 
plant-size differences, since the large values underlying the function would 
represent only the relationships of the large plants and the smaller values the 
relationships of the small plants. 

A second function model might be based on changes in the cross-sectional 
data, with yearly changes in the input-output data correlated. This model 
could be used when it is desirable to reduce the auto-correlation in the input 
series in order to estimate a behavioral parameter for each input. Since first 
differences of the man-hour and KWH inputs do not take account of plant size 
differences and probably aren’t comparable because of measurement difficulties, 
this approach has not been used in the present context. Furthermore, in 
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thinking about production functions in the light of the fixed factor analysis 
above, there is probably little reason to expect the KWH and man-hours to 
change inversely—excepting in periods of technological change. All other 
changes may be due to random forces or non-comparability, and short term 
changes in output are probably better measured by changes in the KWH (or 
man-hour) data alone. 

The model finally selected (since it seems to fit the data best) is based on the 
hypothesis that percentage changes in plant output are a function of percentage 
changes in man-hours and KWHs. In using the two models above, one assumes 
that absolute output changes are a function of absolute man-hour and KWH 
changes (or their differences). Actually, any of the three assumptions might be 
the most useful under given circumstances. By using index numbers to the 
base period 1950-52 in the percentage change model, the auto-correlation of 
the three variables is probably increased (with respect to the first difference 
model) but the standard error of estimate is also probably reduced, which 
should be helpful in the extrapolative procedure. The percentage change ap- 
proach was used in both the production and productivity functions already 
described. 

A final problem concerns the reversibility and durability of the annual pro- 
duction functions. In fact, is a production function reversible? For example, 
if an industry of relatively homogeneous plants were expanding its output 
(with a given level of capital stock), there would result a certain pattern of 
input productivity relationships. Now, if the industry reduces output, need the 
same production function apply? What if the more effective man-power is 
retained or waste is otherwise eliminated, or there are serious inter-plant 
productivity differences, etc? The function is reversible only if the effects of 
other important factors underlying input productivity relationships are either 
held constant or cancel out. Aggregative data are not very useful in these 
matters. What is needed are detailed plant studies dealing with the effects of 
“technological,” “scale,” and “volume” changes, as well as other important 
factors underlying productivity changes.® 

Another important aspect of the same problem is the durability of the pro- 
duction function to be used for extrapolation. As one approach, we have sug- 
gested investigations of the relationships between “technological” or “scale” 
changes and KWH/man-hour functional relationships. It is probable that 
plant and equipment expenditure surveys may reveal important changes to 
come, which can be verified and timed by KWH/man-hour changes. Inquiries 
regarding “technological” or “scale” changes can be directed to key plants. 
However, “techniological” or “scale” changes are not simple matters. All plants 
in an industry do not change their technology or scale simultaneously. The 
deepening or widening of capital (as measured by KWH/man-hour ratios) be- 
gins with one or only a few plants and there may be no noticeable effect at the 
industry level. As the new processes or increases in capacity are diffused, 





* As one example of detailed plant studies, see 8. Melman, Dynamic Factors in Industrial Productivity, John 
Wiley & Sons Inc. New York, 1956. 











342 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 


particularly to the larger plants, a new industry production function has 
evolved. 

The above problems are not exhaustive but merely point out some of the 
difficulties which must be dealt with in the area of aggregative industrial pro- 
duction and productivity measurement. 


Ill, SOURCES OF DATA FOR THE CURRENT MODEL 


To obtain the annual plant data for the production functions, a mail sample 
of about 1200 New England companies was asked to report the annual pro- 
duction worker man-hours, KWHs, value of production (not sales) and 
physical output wherever possible.’ It proved difficuit to deflate the value of 
production data. For the most part the commodity indexes of the Wholesale 
Price Index were used, supplemented by implicit price indexes derived from 
plant physical output and value of output data. Each plant was then coded 
by its principal product in accordance with the Standard Industrial Classi- 
fication, using the U. 8. Bureau of Labor Statistics interpretation of the code 
wherever doubt arose as to the classification of large, multi-product plants. 
From these annual plant data were derived the annual production functions, 
for the interpolative and extrapolative procedures. 

Neither of the monthly inputs, production worker man-hours and kilowatts 
hours, were originally available. The BLS New England Regional Office ad- 
vanced an intended program and began to report monthly man-hours, by two- 
digit SIC groups from January 1950. In Massachusetts greater detail was 
possible, through the Massachusetts Department of Labor and industries. 

The original obstacles were even greater in the case of monthly KWH data. 
First, it was necessary to enlist the support of 19 major electric utilities of the 
region and then to supervise the coding (by SIC) of their customers to make 
them comparable to the BLS man-hours.* Then, the reporting mechanism had 
to be activated by which the monthly data would be made available around 
the twentieth of the month. Today, all utility company reports are collected 
by the Electric Council of New England and the sample coverage ranges from 
sixty to eighty-five per cent of the two-digit SIC groups. The monthly KWH 
are corrected for working day differences, and seven standard holidays, using 
the working days published by the FRB,®* since the man-hours are not subject 
to the same calendar variations from month to month. The working day cor- 
rection is intended to increase the comparability of the monthly man-hour and 
KWH indexes. 





? Approximately 600 firms (covering 700 plants) responded to this survey with usable data. These 700 plants 
produced about one-third of the total value of manufacturing production for New England, in the base period of 
1950-52. (No BLS man-hour data were available prior to 1950, or 1947-49 would have been selected as the base 


® About 125 important self-generators of electric power also report their generation to the Federal Reserve 
Bank of Boston for each calendar month. These data are added to the reported sales of the electric utilities, in each 


* The working days for each four-digit SIC industry were published in a reprint from the Federal Reserve 
Bulletin for December 1953, issued by the Board of Governors, Federal Reserve System, Washington D.C., pp. 42- 
53. 
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IV. PRELIMINARY PRODUCTION FUNCTIONS AND INDICES 


The preliminary production functions were very crude, being based on the 
the annual aggregation of the three variables for all plants in a given industry.’ 
Aggregating variables across plants for the five individual years provides five 
observations for each variable, and relatives (1950-52= 100) were computed 
for the fifteen aggregates." In case the productivity function was used (i.e. 
DVOP/MH =f KWH/MH-+MB), the aggregated variables were transformed 
into KWH/man-hours, deflated value of production/man-hour and man-hour 
indexes. 

From the aggregated relatives, usually a linear function was computed 
for the four years 1950-53 and a prediction made for 1954, using the 1954 
input relatives and the four year function. Not much significance could be 
attached to either regression coefficients or results due to the great restrictions 
on the degrees of freedom. “Good fits” could be expected (from fitting three- 
variable regression functions to five observations), but little information on 
whether a shift in the production function had taken place. A sample illustra- 
tion of this procedure follows: 


TABLE 343 


COMPARISON OF FOUR AND FIVE YEAR FUNCTION PREDICTIONS, 
FOR THE NEW ENGLAND CHEMICAL AND 
PETROLEUM INDUSTRY 














(1950—52 = 100) 
Predictions 
Year KWH/MH MH DVOP/MH 
4 yr. Function | 5 yr. Function 

1950 97.0 94.0 97.7 99.3 97.7 

51 95.8 103.8 96.1 98.7 96.8 

52 107.0 102.2 106.5 106.7 106.8 

53 118.5 105.3 118.8 115.0 117.1 

54 132.0 101.7 128.1 124.6” 129.0 




















1950-53 (4 year) Function DVOP/MH =27.6+.717 KWH/MH +.023 MH. 

1950-54 (5 year) Function DVOP/MH =9.5+.891 KWH/MH +.019 MH. 

P—the value of 124.6 is obtained by substituting the 1954 inputs of KWH/MH (132.0) and MH (101.7) into 
the 1950—53 productivity function, as a rough test of its stability. 


As expected, in the capital intensive Chemical and Petroleum Industry, 
productivity is more closely correlated with “technical” (rather than “volume” 
or “scale”) changes, but with so few degrees of freedom, there was no adequate 
way to test the signficance of the regression coefficients. Having constructed 
the crude preliminary functions, as described above, the monthly KWH and 
MH indexes were computed relative to the base period (1950-52) and sub- 





10 However, other types of disaggregated production functi dels were tested, but due to different average 
productivities as well as different cyclical behavior of the sample plants, better results at the two-digit level were 
not forthcoming. Also, the scatter diagram technique described in footnote 5 on page 339 was used as the criterion 
for choosing either the production or productivity aggregative function for specific industries. 

1 Some 1955 data are now available and the analysis is being extended over the six year period. 
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stituted in the appropriate annual function. With reference to the FRB ap- 
proach, the output/man-hour ratio was changed throughout a year according 
to changes in the monthly KWH and/or MH levels (instead of as a smooth 
curve between annual levels of output and man-hours). Secondly, in extrapolat- 
ing output/man-hour levels one had the benefit of the KWHs (in addition to 
the MHs) in those years for which annual output data were not yet available. 
As long as the industry production functions weren’t discontinuous, the KWH 
data were valuable additions for interpolative and extrapolative purposes. 

Net Value Added by Manufacture weights (the average for 1950-52) for 
each two-digit SIC Industry, were used in constructing the total Monthly 
Index of Manufacturing Production. The weights were derived from the annual 
Surveys of Manufactures of the Bureau of the Census. 

In Chart 345, the preliminary total New England Index of Manufacturing 
Production is compared with that the of FRB for the period of 1950-57. The 
general movements of both seasonally adjusted series are similar, with slightly 
more growth in the national series but a general coincidence of turning points. 
Annual averages are used from 1950 through 1954 and monthly data since that 
time, accounting for the greater variability in the series from 1955. 

One check on the accuracy of the new index was provided by the net value 
added estimates of the Census Bureau (which were undeflated). For compari- 
son purposes, total indexes of man-hours and kilowatt hours are included in 
Table 344 below. All data are for New England: 


TABLE 344 


COMPARISON OF THE TOTAL INDEX OF INDUSTRIAL PRODUCTION, 
NET VALUE ADDED, PRODUCTION WORKER MAN-HOURS AND 
KWH CONSUMPTION FOR NEW ENGLAND 
MANUFACTURING INDUSTRIES 














(1950-52 = 100) 
a Index of Index of Total Index of Total 
Y Manuf oe a Undeflated Production Industrial 
=—_ —— acturing Net Value Worker KWH Sales & 
Production Added* Man-Hours Generation 
1950 91.8 89.5 96.6 96.1 
51 102.6 102.1 102.8 101.7 
52 107 .6 108.4 100.6 102.2 
53 117.5 119.3 102.5 112.0 
54 105.6 109.5 89.4 108.1 
55 113.9 n.a.t 92.2 119.7 

















* The estimates of total net value added by manufacture for New England are taken from the annual Surveys 
of Manufactures, Bureau of the Census, U.S. Department of Commerce. 
t Not yet available. 


Since the Census Net Value Added estimates were uncorrected for price 
changes, they could be expected to show an upward drift over the Index of 
Manufacturing Production. Also, the differences between the estimates of net 
value added and output for individual industries would tend to be larger. Yet, 
there is a general similarity of trends and cyclical turning points in the total 
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net value added and output series compared above. The different trends of 
man-hours and KWH are also noticeable. Table 344 is presented as a sample 
of the external tests applied to our results and to illustrate the necessity of not 
relying on man-hours alone in making productivity estimates. 


Current Production Function Research 


Current research on production functions is aimed at a better utilization of 
the micro data available. One of the reasons for the crude form of the original 
function was the fact that monthly input data (man-hours and KWH) were 
available only in 2-digit Standard Industrisi Classification groups. Now, 
greater monthly input detail is becoming possi!sle and functions are being con- 
structed at the 3-digit level. To obtain the outpvt estimate for a 2-digit SIC 
industry, the outputs of the 3-digit groups are summed, instead of the input- 
output relations. 

Several interesting relationships are also being investigated. For example, 
over the six year period (1950-55), in many industries, the percentage change 
of output was a linear (and sometimes homogeneous) function of the percentage 
change of the man-hour inputs. Discontinuities and non-linearities were not 
frequently encountered. Usually, KWH percentage changes were a linear, (but 
not homogeneous) function of man-hour changes, with no evidence of shifts 
in the function over the six year period. The KWH/Man-hour relationships 
are being tested as indicators of “volume,” “scale” or “technological” changes. 
In the analysis, the input and output data are supplemented by data on piant 


and equipment (book value), number of shifts worked, and wages paid to pro- 
duction workers. 

The above notes indicate only the direction of the research—no firm conclu- 
sions have been drawn as yet. 


Concluding Remarks 


The methods used in the new production indices ought to provide some 
benefits to users. Certainly more accurate production estimates can be made 
from the inputs of both KWH and man-hours than from man-hours alone. 
With reference to the Federal Reserve Board methods, these methods offer 
some specific improvements. First, the functional approach permits produc- 
tivity estimates to be based on “scale” or “volume” changes (as against straight 
interpolation or extrapolation of output/man-hour relationships). Second, in 
cases of “technological” changes, which also change the production functions, 
the shift of the KWH/M-H ratios may very well signal the “technological” 
shift, and probably help one in locating the new production function. The first 
improvement should be valuable in interpolative and the latter in extrapolative 
procedures. , 

Also, due to the simplicity of the index model, involving only two compo- 
nents centrally collected, publication dates tend to be earlier than for more 
complicated indices. Both speed and coverage will be increased when all 
electric utilities, regardless of size, begin to use high-speed equipment for 
billings. 
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Another advantage of the new indices is that measurement is provided in 
areas where no measurement of either production or productivity currently 
exists. Also, the basic data used in constructing the production indices are 
valuable for studies in multi-input productivity or in forecasting future man 
and electric power requirements. 

As is true with any index of production, there are certain limitations to the 
new index. Plants returning the necessary data by mail may make reporting 
errors (although these have been rectified by a follow-up phone call when 
suspected). Also, the sample of plants providing the basis of the annual func- 
tions may not be representative, either because of the exclusion of key plants 
which refuse to cooperate or because of the inclusion of plants which happen 
to be not typical of the industry. Attempts are made to guard against such 
possibilities, but these may not always be successful. 

Since the new model employs the concept of deflated value of production, 
price deflation problems are considerable. It is not aways possible to find the 
exact commodity price to apply to the physical output or value of product data. 
In this event, subjective estimates and compromises have to be relied upon. 
It is hoped that such pricing errors are distributed randomly, so that their effect 
will be minor following the grouping of plants for the annual functions. 

It should be noted also, that there are no benchmark data which can serve 
as absolute checks on the levels of the various indices. The lack of benchmark 
data extends to both monthly and annual series. 

A key limitation, however, is that the productivity or production estimates 
must be based on annual relationships which are two or three years old. This 
limitation is one reason we are investigating the relationships of plant and 
equipment spending, KWH/Man-hour and Output/Man-hour functions. Of 
course, if input/output data could be collected more often, these extrapolative 
errors could be reduced. 

In general, it is no doubt true that the above described indices cannot be 
as accurate as monthly physical output indexes. On the other hand, the inclu- 
sion of the KWH input ought to make possible more accurate interpolations 
and extrapolations of output/labor relations, either within or beyond estab- 
lished annual historical levels. Finally, the basic technique applied to various 
sectors of the economy, can be useful in areas where only employment data 
are currently available. Obviously, additional and different inputs might be 
required in specific situations, to be combined in a way not yet conceived. 
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Expenditure-income relations of selected groups of families are ex- 
amined in two ways: (a) with the importance of transitory income 
increased and (b) with its importance decreased. The importance of the 
transitory income is increased by dividing groups of families into sub- 
groups differing in permanent income, and then examining consumption 
in relation to current income of the subgroups. Such stratification tends 
to increase the impo’ tance of transitory income, since the more homoge- 
neous the families are in permanent income the more the difference in 
income observed is transitory. The income elasticity of total expendi- 
tures is generally lower for the subgroups than for the entire groups— 
by 30 to 40 per cent. Similar tendencies also occur for expenditures for 
food, clothing and other consumer goods. 

The effect of the contrary tendency, i.e., reduced importance of 
transitory income, is tested by using mean income of consecutive years 
as a measure of economic status in place of current annual income. 
When this is done the coefficient of elasticity of expenditures in relation 
to income among families is increased over that observed for expendi- 
tures in relation to annual income. The increase is around 30 per cent. 

The findings support the hypothesis that much of the difference ob- 
served among groups in expenditure-income relations is the result of 
variation in the mixture of the income components. 


AMILY income in any one year is looked upon as having two components, 

described here as permanent and transitory. The permanent component 
can suitably be looked upon as the systematic component that families count 
upon to finance their consumption level.’ It seems highly probable that ex- 
penditures in any one year are more closely related to the permanent than to 
the transitory component. If so the more important the transitory income the 
less will be the relationship between expenditures and income, and the lower 
will tend to be the elasticity of expenditures in relation to income. 

This paper presents findings on the relation of annual expenditures of two 
groups of households to income, under conditions that vary the importance of 
transitory income, the variation being achieved in the manner described above. 
The findings support the assumption concerning the relation of income com- 
ponents to consumption. 





* This article presents a refinement and elaboration of the analysis by Marilyn Dunsing, “Income-Expenditure 
Relations of Farm Families Using Different Stratification,” unpublished doctoral dissertation, Library University 
of Chicago, Aug. 1954. Assistance from the California Agricultural Station made possible the refinement of the 
analysis. 

1 Milton Friedman, A Theory of the C ption Function, National Bureau of Eeonomic Research, Princeton 
University Press, Princeton, 1957, presents a general interpretation of these concepts. 
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1, PREVIOUS ANALYSES OF ENTIRE GROUPS AND RESPECTIVE 
SUBGROUPS WITH COMMON INCOME EXPERIENCE 


W. W. Cochrane? appears to have been the first investigator to have explored 
the effect of expenditure-income relations of stratification into subgroups with 
a common income experience. He compared, however, the income elasticity of 
expenditures of an entire group in 1940 with that of its sub-groups in 1942 that 
had had a common income in 1940; in other words, his comparison was between 
the entire group in one year and sub-groups in a later year. Cochrane referred 
to the expenditure-income relation of the entire group in 1940 as static and 
that of the sub-groups in 1942 as dynamic.The income elasticities of expendi- 
tures of the sub-groups in 1942 were below the elasticity of the entire group in 
1940. Cochrane interpreted his findings as follows: 

...Itis... asserted, given a change in income, that families do not immeditely 
reorganize their budget to conform to the norm of the new income position; hence, 
expenditures ...in a dynamic situation are likely to be smaller, income class by 
income class, as we ascend the income scale, than in a static situation. These con- 
tinuous data present tangible evidence of this tendency.* 

. .. And when we find, as we have in these data, that the elasticities of expenditure 

.. are extremely inelastic—that is, unresponsive to changes in income—then we 
must conclude that expenditures for family living are practically constant in the 
short run... .¢ 


If the hypothesis to be tested in this paper is valid, what Cochrane observed 
was not static expenditure-income relations in 1940, in the sense of a long-run 
adjustment, versus lack of adjustment of expenditures to change in permanent 
income in 1942. Rather, he observed the effect of some transitory component 
in the 1940 incomes of the entire group versus the effect of a larger transitory 
component in the 1942 incomes of the subgroups. 


2. DATA USED AND THE EARLIER ANALYSIS 


The data used come from records of incomes and expenditures kept by farm 
families during three consecutive years. They are for the years 1940, 1941, and 
1942, and were assembled by Cochrane from state Agricultural Experiment 
Stations and from the former Farm Security Administration (now the Farmers 
Home Administration). In this paper families reporting to these two agencies 
will be referred to as Type I and Type II families, respectively.’ Their character 





2 W. W. Cochrane, “Farm family budgets—a moving picture,” The Review of Economics and Statistics, XXIX 
(Aug. 1947), 189-98. 

3 Ibid., p. 195. 

4 Ibid., p. 197. 

5 Income and expenditure items in the account forms are not entirely comparable for the two groups of families. 
Since the present investigation relates to sub-groups versus the entire group of which they are a part, the effect of 
different definitions is not especially relevant. The difference does, however, reduce the usefulness of comparison 
between Type I and Type II families. It is of interest to note that the purchase of consumer durables tended to be 
classed as savings by Type II families and as consumption expenditures by Type I families. 

Some differences between Type I and Type II families may influence the expenditure-income relations. Type I 
families had incomes above the average of their areas. They included both owner and tenant operators, and there 
was some tendency for an over-representation of older, well-established operators. Their accounts were, however, 
wholly voluntary, being submitted with the thought that efficiency of farm operation should be increased by 
the advice received from farm management specialists who had examined the accounts. On the other hand, 
Type II families had i about average of their areas. All were relatively young owner-operators who had only 
recently acquired ownership of their farms by means of a Federal loan. They were required to prepare a budget 
or spending plan covering both the family and the farm, and accounts were in some measure under scrutiny. These 

1 cir t of Type II families make us suspect that their expenditure-income relations probably had 
less in common with those of farm families in general than do the expenditure-income relations of Type I families 
Consequently, in this paper the data of Type I families are examined more intensively than those of Type II families. 
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istics are described in some detail in The Changing Composition of Family 
Budgets for Selected Groups of Corn Belt Farmers, 1940-42 by Cochrane and 
Grigg.’ This analysis uses the following data: those for 236 Type I families in 
Illinois and Minnesota; and those for 587 Type II families in Illinois, Iowa, and 
Minnesota. All these families reported a positive income in each of the three 
years.’ 

Several analyses of these data have already been made. One by Cochrane 
has been commented on in Section 1 above.* An analysis by Reid showed that 
year-to-year variability of farm operating expenses tends to affect tle expendi- 
ture-income relationship observed, and that these account-keeping farm fami- 
lies in this respect are very similar to other farm families. Farm operating ex- 
penses may cause net income to be atypically low for some families and atypi- 
cally high for others. A low income caused by relatively high farm operating 
expenses tends to lower the income elasticity and a high income caused by rela- 
tively low farm operating expenses tends to increase it.° 

Reid has also examined the correlation of incomes of various groups of 
families in consecutive years and found marked ‘year-to-year variability in 
relative income positions. Furthermore, the lower the correlation of incomes in 
consecutive years, the lower tended to be the income elasticity of expenditures. 
It seemed reasonable to assume that the lower the correlation of incomes, the 
greater the importance of the transitory components of income, and the greater 
the importance of transitory incomes, the lower the income elasticity of ex- 
penditures.'° 


In an earlier article analyzing these data Reid and Dunsing concluded that 
variation in level of expenditures among groups of families differentiated by 
degree of income change is probably caused by the presence of transitory in- 
come, for some groups positive and for others negative, rather than by lag in 
expenditures behind change in permanent income." This paper extends the 
analysis of these data by testing the effect on income elasticities of varying 
mixtures of the permanent and transitory income components. 


3. ESTIMATES MADE FOR ENTIRE GROUPS AND RESPECTIVE SUB-GROUPS 


For Type I families income elasticities of expenditures for 1941 and 1942 are 
estimated for both entire groups and for their respective sub-groups. The sub- 





* Bureau of Agricultural Economics, United States Department of Agriculture, 1946. This report and The Re~ 
view of Ei ics and Statistics, XX TX, Aug., 1947, loc. cit. describe the income and expenditure levels of the families 
included in this analysis. Most of the estimates presented in this paper relate to ungrouped data. 

1 The original set consisted of 367 Type I families and 642 Type II families. Sixteen families were excluded be- 
cause & negative income was reported for at least one of the three years. The main exciusion was Type I families 
reporting from Iowa. They were excluded because of extremely high year-to-year variability of incomes. The 
variability was so high that incomes of any one year seemed likely to have little or no correlation with permanent 

Such correlation was a necessary condition in order to ensure subgroups with a transitory income of 
greater importance than that of the entire group. Type II families of Wisconsin were excluded because of the small 
number of families. 

* The tabulation and analysis of the data by Cochrane go far beyond the comparison of income elasticities of 
expenditures of entire groups and respective sub-groups described in Section 2. 

* “Effect of Income Concept upon Expenditure Curves of Farm Families,” Conference of Research in Income and 
Wealth, National Bureau of Economic Research (Cambridge Press, 1952), XV, pp. 131-74. 

© For a preliminary report see Margaret G. Reid, “Savings by Family Units in Consecutive Periods,” W. W. 
Heller, ed.), Savings in the Modern Economy, Symposium, Minneapolis, 1953, pp. 218-20. Further details are 
reported by Friedman, op. cit., p. 185, 191-3. 

ul “Effect of Variability of Incomes on Level of Iacome-Expenditures Curves of Farm Families,” Review of 
Economics and Statistics, XX XVIII, Feb. 1956, pp. 90—5. 
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groups for 1941 are those with 1940 incomes falling within specified intervals. 
(See Table 352, Panel A.) For 1942 there are two sets of sub-groups: (a) those 
with common income interval in 1940 and (b) those with common income 
interval in 1941. (See Table 352, Panel B.) For each of the three sets there are 
at least seven sub-groups, most of which have an income range of $500 in the 
common income interval year; consequently, the number of families in any one 
sub-group is small.” 

For Type II families income elasticities of expenditures for 1942 only are 
estimated. There are two sets of sub-groups for 1942: (a) those with common 
income interval in 1940 and (b) those with common income interval in 1941. 
The intervals have an income range of $1,000. With such income intervals there 
are only three sub-groups in each set.* (See Tabie 353.) 


4. FORM OF THE EXPENDITURE-INCOME RELATION 


Preliminary examination of the data indicated that on logarithmic scales 
total expenditures in relation to income are fairly linear. The equation is 
log Y=log a+b log X where X represents annual net money income and Y 
represents annual cash expenditures for (a) all consumer goods, (b) food, 
(c) clothing, (d) household, (e) medical care, and (f) other items. For Type I 
families all coefficients are estimated using ungrouped data and for Type II 
families they are estimated using grouped data.“ 


5. INCOME ELASTICITIES OF TOTAL EXPENDITURES OF 
ENTIRE GROUPS AND RESPECTIVE SUB-GROUPS 


Type I Families:* For Type I families the income elasticities of total ex- 
penditures of entire groups was uniformly higher than that of the subgroups. 
They were as follows: 








Entire group Sub-groups 





Year of expenditure- 
income relation Income elasticity 
of expenditures 


Mean income Year of 
elasticity of common-income 
expenditure interval 





1941 .394 . 242 1940 
1942 -472 .337 1940 
1942 -472 326 1941 











In addition, the income elasticities of total expenditures for 1941 and 1942 for 
the majority of the sub-groups of each set are lower than for the entire group." 





12 For each set the selected sub-groups include over 80 per cent of the 236 families. 

3 The selected sub-groups with common income interval in 1940 include 94 per cent of the 587 families and 
those with common income interval in 1941, 89 per cent. 

“4 Data are grouped by $1,000 class intervals. 

% All findings presented for Type I families are either taken directly from, or derived from, Table 352. 

% For 1941 there are seven svb-groups; in six of these the income elasticity of total expenditures is less than that 
of the entire group. For the two sets of sub-groups for 1942 the results are similar. Thus the relationship between 
the elasticities of the sub-groups and that of the entire group is consistent even though the number of families in the 
various sub-groups is small. Such exceptions as occur appear to be random. 
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TABLE 352 


AVERAGE INCOME ELASTICITY OF EXPENDITURES* OF VARIOUS 
CONSUMER GOODS, 1941 AND 1942, OF TYPE I FARM FAMILIES, 
ENTIRE GROUPS AND RESPECTIVE SUB-GROUPS 








italian Category of consumer goods 


Groups and sub-groups of 
farnilies Total Food Clothing 





House- Medical 


a «=. 





Panel A. Type I Farm Families, 1941 





Entire group in 1941 3 . 394 .185 





1. Sub-groups with 1940 income” 
. $1,000 to under $1,500 . .098 
. $1,500 to under $2,000 a .080 
. $2,000 to under $2,500 
. $2,500 to under $3,000 é .348 
». $3,000 to under $3,500 
. $3,500 to under $4,000 4 — .061 
. $4,000 to under $5 000 ; .369 











Weighted mean of sub-groups F .159 





Panel B. 





Entire group ir 1942 236 





1. Sub-groups with 1940 income of 
. $1,000 to under $1,500 
. $1,500 to under $2,000 
. $2,000 to under $2,500 a .037 
. $2,500 to under $3,000 . .242 
. $3,000 to under $3,500 
. $3,500 to under $4,000 d . 206 
. $4,000 to under $5,000 R .100 





Weighted mean of sub-groups .136 





2. Sub-groups with 1941 income of 
. $1,000 to under $2,000 ‘ .169 

$2,000 to under $2,500 21 , .039 -086 

. $2,500 to under $3,000 31 é -105 .449 

$3,000 to under $3,500 28 4 166 

$3,500 to under $4,900 19 ‘ J -036 

$4,000 to under $4,500 21 d .025 .438 

$4,500 to under $5,000 22 J 7 .408 J - 503 
.. $5,000 to under $6,000 33 F .353 .535 .058 -185 


PRoo po oF 











Weighted Mean of sub-groups 190 .326 .130 .350 .163 .377 





* All estimates of income elasticities relate to money expenditures in relation to money income. 

> The income used in classification includes the value of home production and of housing received without direct 
expense; this is the same procedure as that used by Cochrane. The same relationship would, however, be obtained 
between entire groups and sub-groups if families had been divided on the basis of net money income. 

Source: See text, Section 3. 





Regressions of expenditures on income for the entire group and a specificd 
subgroup for 1941 and 1942 are illustrated in Figure 355. For 1941 curve AB 
representing the entire group has an elasticity of .394, and curve CD represent- 
ing the sub-group with common income interval of $1,500 to under $2,000 in 
1940 has an elasticity of .223. For 1942 curve EF representing the entire group 





EFFECT OF TRANSITORY INCOME ON INCOME ELASTICITY 353 


TABLE 353 


AVERAGE INCOME ELASTICITY OF EXPENDITURES* OF VARIOUS 
CONSUMER GOODS, 1942, OF TYPE II FARM FAMILIES, ENTIRE 
GROUP AND SUB-GROUPS 








Category of consumer goods 
Number ! 
Groups and sub-groups of a ea 


families | Total Food ine hold Other 








Entire group in 1942 587 .383 6 .252 .306 .363 @#8.627 





1. Sub-groups with 1940 income? of 
a. $1,000 to under $2,000 3385 .124 .267 8.304 
b. $2,000 to under $3,000 . 278 .200 = .22: .291 
c. $3,000 to under $4,000 207... SOB - 0. ES 





Weighted mean of sub-groups ‘ 305 .166 3 .2: . 283 





2. Sub-groups with 1941 income of 
a. $1,000 to under $2,000 .332 .140.. .427 
b. $2,000 to under $3,000 -263 .170_ .18: .153 
c. $3,000 to under $4,000 37 .. 211 , . 264 





Weighted mean of sub-groups ; .. — . 246 











* See Table 352 for an explanation. 
> See Table 352 for an explanation 
Source: See text, Section 3 


has an elasticity of .472 and curve GH representing the sub-group with com- 
mon income interval of $3,000 to under $3,500 in 1940 has an elasticity of .333 
(Panel B).?” 

Subsidiary findings of special interest which relate to total expenditures in- 
clude: (1) The income elasticity of total expenditures is lower for the entire 
group in 1941 than in 1942; the same is true of the mean elasticities of the sub- 
groups. (2) The mean elasticities of the two sets of sub-groups in 1942 are 
essentially the same, namely, .337 for the sub-groups with common income 
interval in 1940 and .326 for those with common income interval in 1941. 

(3) The mean elasticity of the sub-groups appears to be a function of the 
absolute level of the elasticity of the entire group. It seems reasonable to assume 
that the greater the importance of the transitory component in the incomes of 
the entire group, the greater its importance in the incomes of the respective 
sub-groups. The difference between the mean income elasticities of the sub- 
groups and the elasticity of the entire group is essentially the same for the 
three sets; the method of classification appears to have added about the same 
amount of transitory component to the incomes. The differences are as follows: 





17 The sub-groups selected have the following characteristics: (1) They have income elasticities which are repre- 
sentative of the mean elasticities of the sub-groups of their respective sets. (2) They illustrate one and two year 
intervals between the common income interval year and the year of income elasticity. (3) They illustrate positions 
nearer the lower and upper end of the income size distribution for their respective entire group of families in the 
common income interval year. 
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Difference between the mean 
elasticity of sub-groups and the 
elasticity of the entire group 





Sub-groups of 1941 — .152 
Sub-groups of 1942 with common income interval 
a. in 1940 — .135 
b. in 1941 — .146 








(4) The ratio of the mean income elasticities of the sub-groups to the elastic- 
ity of the entire group is somewhat lower for the 1941 set than for the two 
1942 sets. The ratios are as follows: 








Ratio of the mean elasticity of 
sub-groups to the elasticity 
of the entire group 





Sub-groups of 1941 .614 
Sub-groups of 1942 with common income interval 
a. in 1940 .714 
b. in 1941 691 








Type II Families:** Tendencies for Type II families were similar to those for 


Type I families. The income elasticity of expenditures of the entire group of 
Type II families for 1942 was higher than the mean elasticity for the subgroups. 
It was .383 for the entire group and .305 for the sub-groups with common in- 
come interval in 1940 and .299 for the sub-groups with common income interval 
in 1941. In addition the income elasticity was higher for the entire group than 
for any of the sub-groups. 

Subsidiary findings for Type IT families relating to total expenditures include: 
(1) the mean elasticities of the two 1942 sets of subgroups are very similar, i.e., 
those with common income intervals in 1940 and those with common income 
intervals in 1941. 

(2) The difference between the mean income elasticities of the sub-groups 
and the elasticity of the entire group is —.078 for the sub-groups with common 
income interval in 1940, and —.084 for those with common income interval in 
1941. (3) The ratio of the mean income elasticities of the sub-groups to the 
elasticity of the entire group is .796 for the sub-groups with common income 
interval in 1940, and .781 for those with common income interval in 1941. 
These findings of smaller differences and higher ratios for Type II than for 
Type I families for 1942 are to be expected since the intervals for Type II 
families have a larger income range than those for Type I families.'* 





18 All findings presented for Type II families are either taken directly from, or derived from, Table 353. 

19 The larger the income range in the common income interval year, the less the tendency for the families of 
each of the sub-groups to be alike with respect to the permanent components of income. Hence, in the later year, 
the less the dispersion of i consists of transitory components the less will be the difference in the relative 
importance of the transitory components of income between the sub-groups and the entire group. 
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6. INCOME ELASTICITIES OF EXPENDITURES OF VARIOUS 
CATEGORIES OF CONSUMER GOODS 


Type I Families:*° For five categories of consumer goods the mean income 
elasticities of expenditures of the sub-groups of Type I families for 1941 and 


3.6 





PANEL A--1941 


AB--ENTIRE GROUP y=1.6 +.394x 
CD--SUB-GROUP OF 1940 

INCOME INTERVAL 

$1500 TO UNDER $2000 y=2.3 +.223x 


FAMILY EXPENDITURES (LOGARITHMS) 








1 l L l 
3.0 3.2 3.4 3.6 
NET MONEY INCOME (LOGARITHMS) 








PANEL. B--1942 


EF--ENTIRE GROUP y=15 + .472x 


GH--SUB-GROUP OF 1940 
INCOME INTERVAL 
$3000 TO UNDER $3500 y=2,1 + .333x 


(SWH1IYY5O1) SIYNLIGNSdX3 ATIWVI 








i ! ! ! ! ! 
3.2 3.4 3.6 3.8 4.0 4.2 4.4 


NET MONEY INCOME (LOGARITHMS) 


Fie. 355. Family Expenditures in relation to Net Money Income, Type I 
families. Entire group and a selected subgroup, 1941 and 1942. 


20 All findings presented for Type I families are either taken directly from, or derived from, Table 352. 
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1942 are uniformly lower than the elasticity of the entire group. For 1941 they 
are as follows: 
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Mean elasticity of Elasticity of entire 
Type of consumer good sub-groups of 1941 group of 1941 
Food .159 .185 
Clothing . 253 .375 
Household . 266 .387 
Medical care .364 .470 
Other .243 .442 











For 1942 the mean income el.sticities of the two sets of sub-groups and that 
of the entire group are as follows: 














Mean elasticity of sub-groups of 1942 
with common income interval year of | Elasticity of entire 
Type of consumer good group of 1942 
1940 1941 

Food .136 .130 .205 
Clothing .332 .350 .498 
Household .113 .163 . 290 
Medical care .385 .377 .604 
Other .474 .495 .636 











In addition, the income elasticities of expenditures of the five categories of 
consumer goods in 1941 and 1942 are lower for most of the sub-groups of each 
set than for the entire group.” 

The difference between the mean income elasticities of expenditures of the 
sub-groups and the elasticity of the entire group reflects the effect of the 
greater importance of the transitory component of income in the sub-groups. 
This difference is less for food than for the other categories of consumer goods. 
The differences are as follows: 








Difference between the mean elasticity of sub-groups 











§ and the elasticity of the entire group 
¥ Type of consumer good Sub-groups of 1942 with common 
Sub-groups of 1941 income interval year of 
1940 1941 
Food — .026 — .069 — .075 
Clothing — .122 — .166 —.148 
Household —.121 —.177 —.127 
Medical care — .106 — .219 — .227 
Other —.199 — .162 —.141 











% For 1941 there are seven sub-groups; for five of the sub-groups for food, four for clothing, six for household, 
four for medical care, and five for other the income elasticities of expenditures of the various categories of consumer 
goods are less than the elasticity of the entire group. For the two sets of sub-groups for 1942 the results are similar. 
As in the case of total expenditures the exceptions to the tendency of the income elasticities of sub-groups to be less 
than that of the entire group, for the various categories of consumer goods, appear to be random. 
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When the difference between the mean income elasticities of the sub-groups 
and the elasticity of the entire group is expressed as a ratio, the additional 
transitory component of income of the three sets of sub-groups appears to 
affect the income elasticities of the five categories of consumer goods in much 
the same way. The ratios are as follows: 








Ratio of the mean elasticity of sub-groups to the 
elasticity of the entire group 











Type of consumer good Sub-groups of 1942 with common 
income interval year of 
Sub-groups of 1941 © 

1940 1941 
Food .859 .663 -634 
Clothing .675 .667 .703 
Household .687 .390 .562 
Medical care .774 .637 .624 
Other .550 .745 .778 











While it seems likely that the purchase of certain consumer goods such ag 
durables is influenced more by transitory income than the purchase of other 
goods such as food, the subsidiary findings provide no basis for such differenti- 
ation among the categories. Market controls by 1942 may have had a differen- 
tial effect on the various categories of consumer goods; there seems to be no 
way of isolating this effect. 

Type II Families:* For four categories of consumer goods the mean income 
elasticities of expenditures of the sub-groups of Type II families for 1942 are 
uniformly lower than the elasticity of the entire group. For the two sets they 
are as follows: 








Mean elasticity of sub-groups of 1942 
with common income interval year of | Elasticity of entire 


Type of consumer good group of 1942 








1940 1941 
Food . 166 .174 .252 
Clothing . 234 .210 .306 
Household . 283 . 246 .363 
Other .520 .548 .627 











The difference between the mean income elasticities of expenditures of the 
sub-groups and the elasticity of the entire group is, in general, the same for the 
various categories of consumer goods. The differences are as follows: 





2 All findings presented for Type II families are either taken directly from, or derived from, Table 353. 

2 In addition, the income elasticities of expenditures of the four categories of consumer goods in 1942 are lower 
for all of the sub-groups with common income interval in 1940 and for two of the three sub-groups with common in- 
come interval in 1941. 
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Difference between the mean elasticity of sub-groups 
and the elasticity of the entire group, for sub-groups 
Type of consumer good with common income interval year of 
1940 1941 

Food — .086 — .078 
Clothing — .072 — .096 
Household — .080 —.117 
Other — .107 — .079 








As with Type I families, the ratio of the mean income elasticities of expendi- 
tures of the sub-groups to the elasticity of the entire group for Type II families 
indicates that the additional transitory component of income of the sub-groups 
affects the income elasticities of the four categories of consumer goods in much 
the same way. The ratios are as follows: 














Ratio of the mean elasticity of sub-groups to the 

elasticity of the entire group, for sub-groups with 
Type of consumer good common income interval year of 
1940 1941 
Food .659 .690 
Clothing . 765 .686 
Household .730 .678 
Other .829 .874 








7. ELASTICITY OF EXPENDITURES IN RELATION TO CURRENT 
INCOME AND TO INCOME OF THREE YEARS 


If income of three years is a better index than current income of the perma- 
nent income of families, and if expenditures are more closely related to perma- 
nent than to current income, then one would expect the elasticity of ex- 
penditures in any year to be higher in relation to income of three years than to 
current income. Such a difference is observed for Type I farm families for all 
categories of consumption. The coefficients of elasticity of expenditures in 
1942 in relation to income are as follows: 











Expenditures Current Income Income of Three Years 
Total 394 .618 § 

Food .185 . 253 

Clothing .375 .588 

Household .387 .463 

Medical Care .470 .767 

Other .442 .818 











A similar estimate for total expenditures in 1942 was made for Type II 
families. It shows the following coefficients of elasticity of 1942 expenditures in 
relation to income: income of 1942, .383 and three-year income, .508. 

Thus for Type I families income elasticity of total expenditures in relation to 
three-year income was higher by 31 per cent than that in relation to current 
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income, and of Type II families 32 per cent. For Type I families the tendency 
for each consumption category was similar to that of total expenditures. 

This comparison demonstrates that elasticity of expenditures in relation to 
annual income, at least for farm families, is likely to understate the difference 
in expenditures among families that occurs because of difference in level of 
income. It does not, however, provide an estimate of the elasticity of expendi- 
tures in relation to permanent income, since it seems highly probable that 
transitory income influenced the variability of the three-year incomes. 

Furthermore for Type I families the elasticity of expenditures for consumption 
in 1942 in relation to mean income of three years is about 84 per cent higher 
than the mean elasticity for the sub-groups stratified by the income of an earlier 
year, and for Type IT families about 70 per cent higher. 


8. SUMMARY AND CONCLUDING REMARKS 

The general hypothesis of this paper is that the more the difference in in- 
comes of families consists of transitory components of income, the lower will 
tend to be the elasticity of expenditures in relation to that income. The hy- 
pothesis was tested by dividing a group of families into sub-groups that have 
& common income experience in an earlier year. This method of classifying 
families causes the transitory components of income in a later year to be more 
important for sub-groups than for the entire group. It was further tested by 
estimating expenditures in relation to three-year incomes. 

Income elasticities of expenditures observed support the hypothesis. The 
income elasticity of total expenditures for entire groups is uniformly higher 
than the mean elasticities of the respective sub-groups. In addition, for various 
categories of consumer goods the income elasticity of expenditures for entire 
groups is also uniformly higher than the mean elasticities for the respective 
sub-groups. Furthermore income elasticities of expenditures in relation to 
three-year incomes was higher than those in relation to one-year incomes. 

Higher elasticities for entire groups than for respective sub-groups indicate 
that the more alike families are in the permanent components of their income, 
the more the difference in income reported in any one year is likely to be caused 
by transitory income. Thus the income elasticity of expenditures of a single 
occupational group with a relatively narrow range of permanent incomes is 
likely to be lower than the income elasticity of expenditures of all occupational 
groups combined unless the group with the lower dispersion of permanent in- 
comes also has a lower dispersion of transitery incomes. 

Probably the most important finding of this analysis is the further evidence 
it provides that much of the variation in the income elasticity of expenditures 
observed among groups of families is a function of the mixture of permanent 
and transitory income components. Because of such mixture, the elasticities of 
expenditures estimated from the data give a biased estimate of the relation of 
expenditures to the permanent component of income and of the effect on ex- 
penditures of a recent change in the permanent component of income. While 
the elasticities presented in this paper are estimated from data of farm families, 
similar results would be expected for other groups of families whose incomes 
also contain considerable transitory components. It seems highly probable that 
all of the income elasticities of expenditures reported in this analysis understate 
the income elasticity of expenditures in relation to the permanent component 
of inenme. 











THE FIRST 1,945 BRITISH STEAMSHIPS 
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This paper presents data showing the main characteristics of all the 
steamships built in the United Kingdom and placed under permanent 
British registry from 1814 through 1860. The historical setting in which 
the British steam merchant fleet developed is discussed and the data 
are interpreted in relation to that environment. It is argued on the 
basis of these data that many of the prevailing notions about steam 
shipping before 1860 need to be revised. 


1, INTRODUCTION 


HIS paper presents data, which have not been utilized before, showing the 
yee characteristics of all the steamships built in the United Kingdom 
and placed under permanent British registry! from 1814 through 1860. These 
data should provide scholars the means to gain a clearer perspective of the 
history of steam navigation prior to the outbreak of the American Civil War. 

The only official data on British steamship construction which have been 
generally available to scholars heretofore have been the “Number and Ton- 
nage” figures given in the Statistical Abstract for the United Kingdom from 1840 
onwards. These Statistical Abstract data are aggregates enumerating all steam- 
ships built and first registered in the U.K. and contain no information regarding 
how much of annual construction actually passed into permanent U.K 
registry. Nor do these series contain any information regarding the number 
of wood and iron steamers built, or the types, whether paddle or screw pro- 
pelled. The new series presented in this paper fill all of these gaps.” In addition, 
measures have been constructed which show carrying capacity of new con- 
struction entered into permanent U.K. registry. These measures take into 
account advances in the technology of shipbuilding. 

The present paper presents information which indicates that many of the 
prevailing notions about steam shipping before 1860 need to be rather sharply 
revised. It is shown, for example, chat the general adoption of iron in steam- 
ship construction dates from the 1840’s, more than a full decade earlier than 
the accepted estimates; that by the end of the 1850’s most of the British steam 
merchant fleet was iron rather than wood; that the iron-screw steamer was 





1 The data presented below are not the same as annual data given in the Statistical Abstract for the United King- 
dom for ships “built and first registered in the United Kingdom.” Such data include ships which, although first built 
in the U.K., later passed out of British registry by reason of sales to foreigners. As is indicated below, the ships 
included in the data presented in this paper were entered permanently into U.K. registry during the period covered 
by these data. Hence in some years sales of British-built st to foreigners is reflected in the difference between 
the first registrations and the permanent registrations indicated by the data below. Similarly, because of British 
purchases of steamers from abroad (which might originally have been British built but which had temporarily 
passed out of British registry), additions to the British steam merchant fleet may in some years seem to be slightly 
larger than was warranted by current U.K. construction. For details of British ship registry practices see: Ernest 
W. Blocksidge, Hints on the Register Tonnage of Merchant Ships, Liverpool, 1942, pp. 9-13. 

2 The auth are indebted to the Cunard Steamship Company Ltd., for providing data on early Cunard 
Steamers which could be used in checking the accuracy of the transport capacity measures in Table 367 below, and 
for information on the obsolescence of early steamers. 
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predominant in new British steamship construction from at least 1851 on- 
wards; that the 1853 trade boom together with the stimulus of the Crimean 
War demand for shipping were decisive in introducing the iron-screw steamer; 
that the iron-screw steamer very largely supplanted both wood and iron 
paddle steamers by the end of the Crimean War. On the basis of this evidence 
including our measure of transport capacity it will be argued that contempo- 
raries were probably correct in their estimates of the extent of the steamship’s 
penetration of the shorter ocean-freight routes in the 1850’s and early 1860’s~— 
estimates which have been thought to be open to serious doubt. It is argued 
that Britain’s new steam merchant marine was a powertul influence in the de- 
velopment of Britain’s exceedingly strong balance of payments position in the 
third quarter of the 19th Century. 


2. THE DATA 


At the order of the 1861 session of Parliament a return was printed containing 
certain information covering all of the “Steam Vessels” permanently registered 
in the merchant fleet of the U.K. on or before January 1, 1861.* The following 
information is contained in this Parliamentary Paper: the length, breadth, and 
depth of hold of each ship is given, together with its gross and net tonnages, 
and the indicated horse power of its engines. The type of each ship is given, 
whether it was wood paddle, wood screw, iron paddle, iron screw or steel. 
Finally, the date of build, port of registry, and name of the registered builder 
(or builders) for each ship is stated. 

There are 1,945 steam vessels listed in this Silabais as “the whole” number 
of British steamers permanently registered in the U.K. from 1814 through 1860, 
with a total of 440,999 net and 692,101 gross tons.‘ The data are continuous 
from 1823 onwards and hence the analysis below concentrates on the period 
1823-1860. 

It is not difficult to understand why these data have not been utilized before, 
considering their organization—or rather, their lack of it. The ships were 
given numbers by 95 British ports of registry from London to Wexford. This 





3 Full title: “A RETURN, ‘in a Tabular Form, with Consecutive Numbers, of the whole of the Steam Vessels 
Registered in the United Kingdom on or before the Ist day of January 1861; stating in separate Columns, the fol- 
lowing Particulars:—Official Number of Vessel; Vessel’s name; Port of Registry; Date of Registry; Date of Build; 
Registered Owners; Dimensions of Vessels in Length and Breadth, and Depth of Hold; Tonnage (exclusive of Engine 
Room); and Gross Tonnage, distinguishing Vessels built of Iron and Vessels having Screw Propellers; also dis- 
tinguishing Vessels measured under the Merchant Shipping Act of 1854 from those measured under previous Acts; 
and Estimated Horse Power of their Engines, and giving the Aggregate Number of Vessels and Amount of Tonnage; 
with an Index for easy reference attached to it, giving the Names of the Vessels in Alphabetical Order, with Numbers 
to each, corresponding with the Consecutive Numbers in the Return (in continuation of Parliamentary Paper No. 
449, of Session 1869)',” Accounts and Papers, 1861, LVIII (371). 

‘ The “net tonnage” figures in Tables 363 and 367 below are the official “gross tounage” measur ts less ton- 
nage assigned to the engine rooms of the ships. Gross tonnage is generally “the measurement of the under-deck ton- 
nage, with the addition of the 'tween-deck spaces situated above the tonnage deck and all ‘enclosed’ spaces above the 
upper deck.” Blocksidge, op. cit., p. 7. Of the 1,944 ships included in Tables 363 and 367, 94 ships lacked entries in 
one or more of the categories “net tonnage,” “gross t ” “indicated horsep .” Estimates of the missing 
measurements were based on corresponding measurements for other ships built in the same year, or as nearly in the 
same year as possible (and if possible, ships having the same owners), and having the same dimensions and known 
tonnages and horsepower as the ships in question. It was felt that this procedure yielded more accurate estimates 
than would the standard procedure of using linear regressions. The error introduced by these estimates is no doubt 
an insignificant part of the total. Tables 363 and 367 include river steamers; these are not included in certain Statisti- 
cal Abstract data on steamships. Because these ships were all officially registered, they were of greater than 15 net 
register tons; ships of less than that tonnage (or 15 tons burden under the older British Registry legislation) need 
not have been officially registered. Blocksidge, ep. cit., pp. 9-10. 
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serial ordering begins with London, presumably because it had the greatest 
number of registries, and then proceeds alphabetically from Aberystwith to 
Yarmouth for 63 English and Welsh ports. The numbers continue from Aber- 
deen to Wigtown for Scottish ports and from Ballina to Wexford for Irish 
ports. A special index follows with the names of all the ships in alphabetical 
order. Each ship entry includes the official British registry number, but one can 
conveniently find an individual ship only if one knows its name; the registry 
numbers are not presented serially. No attempt was made to order the ships 
other than by port of registry. To use these data for different purposes involved 
their complete reorganization at each stage of analysis; 1,945 individual ships 
with at least 13 entries per ship—a considerable task if done by hand. Punch- 
card calculators were used by the present authors for the thousands of com- 
putations involved in organizing these data.® 


3. GROWTH OF THE BRITISH STEAM MERCHANT FLEET 1823-1860 


In Table 363 the data are organized in continuous annual series running 
from 1823° through 1860. By the end of 1860 a total of 1,945 steam vessels 
had been entered into permanent British registry with 692,101 gross tons.’ 
It is seen that the growth of the steam merchant fleet was slow until the early 
1840's, a peak was reached in 1841, another in 1847 and in 1850 there began a 
strong upward surge in additions to the fleet which reached its peak in 1855, 
the final year of the Crimean-War boom in shipbuilding. A substantial re- 
covery became apparent only in 1860. 

These movements in additions to the steam merchant fleet roughly corre- 
spond to changes in the data for total ships (both sail and steam) built and first 
registered in the United Kingdom. This may be seen in Chart 365 where data for 
the volume of foreign trade are compared to data for ships built and first 
registered in the U.K. The upward trend in foreign trade (and, ceteris paribus, 
in total freight revenues) no doubt accounts in large part for the accompanying 
upward trend in the tonnage of ships built and first registered in Britain. The 
rise in the trade data accelerates appreciably in the late 1840’s and in the 1850’s 
and is accompanied by a sharp upturn in shipbuilding. We will turn to this in 
detail below, since it was in this trade upswing that the iron steamer came into 
its own. The boom in shipbuilding centering upon 1840-1841 is evident in 
both the data for total ships built and registered and in those for steamers 
entered into permanent British registry (cf. Chart 365 and Table 363) except 
that the peak in total ships built and first registered comes in 1840 while the 
peak for steamers added to the British merchant fleet comes in 1841. Whereas 

6 The authors are indebted to the Computing Laboratory of Purdue University for assistance with the computa- 
tions. We would be giad to provide at cost a set of punched cards containing the original data to any scholar inter- 
ested in making further use of these data. 

* According to these returns a single steamer, the “Industry” was entered permanently into British registry in 
1814. The port of registry was Glasgow; the owner was A. McGeorge. This ship was 68 feet long, 17 feet wide, 8 feet 
deep, net tonnage was 43, gross tonnage 69, and indicated horsepower was 14. No other steamers appear until 1823 
when 4 were put under permanent registry. 

1 Including the adjustments in the data noted above. In 1860 the Statistical Abstract lists 2,000 as being the 
number of steam vessels “registered as belonging” to the United Kingdom and the register tonnage is given as 
454,327. The differences between these figures and those in Table 363 are easily accounted for. The Statistical Ab- 
stract data include vessels registered in the channel islands, whereas the data in Table 363 include only registry ports 


on the main islands (i.e., Great Britain and Ireland) and the Statistical Abstract data are register tonnages. The latter 
are significantly lower than gross tonnage estimates. See Blocksidge, op. cit., pp. 5-9 on tonnage measurements. 
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TABLE 363 


DISTRIBUTION OF STEAMSHIPS PERMANENTLY REGISTERED IN THE 
UNITED KINGDOM BY TYPE, NUMBER, NET AND GROSS TONNAGE, 
FOR EACH YEAR FROM 1823 TO 1860 






































Wood Steamships 
Year 
ath 4 Paddle Screw Total 
No. Net Tons Gross Tons No. Net Tons Gross Tons No. Net Tons Gross Tons 

1823 4 427 817 — -- — 4 427 817 
1824 4 341 682 _ = -— 4 341 
1825 5 419 894 _ _ 5 419 894 

826 11 964 1,831 ~- -— -~ ll 964 1,831 
1827 3 451 662 _ —_ —_ 3 451 
1828 3 63 141 _ = — 3 63 141 
1829 4 301 613 oa -- — 4 301 613 
1830 5 443 752 _ — _ 5 443 
1831 4 447 725 _ _ — 4 447 725 
1832 8& 910 1,491 _ _ _ 8 910 1,491 
1833 8 503 1,222 _ _ _ 8 503 1,222 
1834 17 2,350 4,042 _ _ _ 17 2,350 4,042 
1835 20 2,273 4,187 _ _ _ 20 2,273 4,187 
1836 18 2,468 4,449 _ _ _ 18 2,468 4,440 
1837 29 3,632 6,285 _ _ _ 29 3,632 6,285 
1838 31 1,789 3,547 _ —_ — 31 1,739 3,547 
1839 16 1,473 2,732 _ - - 16 1,473 2,732 
1840 23 ,936 5,572 1 292 355 24 3,228 5,927 
1841 27 9 ,265 14,644 _ _ a 27 9,265 14,644 
1842 22 4,734 8,465 _ _— — 22 4,734 8,465 
1843 27 1,547 3,223 _ - _ 27 1,547 3,233 
1844 23 2,481 4,631 _ _ _- 23 2,481 4,631 
1845 15 9 1,806 — 15 927 1,806 

26 2,027 3,719 1 163 240 27 2,190 3,959 
1847 47 4,396 8,735 - - — 47 4,396 8,735 
1848 46 2,506 5,399 _ : _ 46 2,506 5,399 
1849 22 2,177 3,815 -- ~ - 22 2,177 3,815 
1850 12 3,525 6,268 2 549 698 14 4,074 6 966 
1851 20 5,179 8,758 1 103 151 21 5,282 8,909 
1852 35 2,669 5,319 2 180 265 37 2,849 5,584 
1853 31 3,571 6,239 3 221 321 34 3,792 6,560 
1854 26 2,326 4,206 3 1,527 2,189 29 3,853 6,395 
1855 28 3,235 5,328 2 162 185 30 3,397 5,513 
1856 43 2,768 5,719 2 163 232 45 2,931 5,951 
1857 55 1,066 4,057 5 163 oe 1,229 4,370 
1858 33 2,245 4,710 2 64 112 35 2,309 4,822 
1859 39 771 2,619 5 858 1,182 44 1,629 3,801 
1860 35 1,096 2,769 5 195 275 40 1,291 3,044 
Iron Steamships 

Year 

of . Paddle Screw Total 
Buil 

7 No. Net Tons Gross Tons No. Net Tons Gross Tors No. Net Tons Gross Tons 
1823 _ = ~y ~ —_ ~~ _ _ _ 
1824 _ - - — — -- _ 
1825 - —_ “ -_ ~ — — — 
1826 — — -_ — ~- ~~ _ _ 
1827 _— _ ae — — _ _ - — 
1828 _ _ — =< — _ —_ —_ — 
1829 _ _ —_ one — — — - aa 
1830 — _ ~t — —_ _ _ — _ 
1831 _ _ — — _ - _ _ _ 
1832 _ _ — — _ — _ — _ 
as SS on vm “4 vs oe 
1835 _ _ _ i 368 473 1 368 473 
1836 2 194 284 _- -- 2 194 284 
1837 4 390 677 _- -- _ 4 390 677 
1838 5 174 298 _ _ = 5 174 298 
1839 5 5 559 _- —_ _ 5 5 559 
1840 7 576 947 1 1,734 3,509 & 2,310 4,456 
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TABLE 363—(continued) 









































Iron Steamships 
Year 
awns Paddle Screw Total 
No. Net Tons Gross Tons No. Net Tons Gross Tons No. Net Tons Gross Tons 

1841 971 1, - _ — 7 971 1,548 
1842 10 818 1,319 1 116 171 ll 934 1,490 
1843 6 1,105 — ~~ _ 6 669 1,105 
1844 29 4,025 6,947 3 255 400 32 4,280 7,347 
1845 26 3,126 5,483 3 424 651 29 3,550 6,134 
1846 17 4,589 7,233 10 2,716 3,803 27 7,305 11,036 
1847 31 7,919 12,270 6 1,777 2,458 37 9 ,696 14,728 
1 19 3,518 5,853 6 1,667 2,999 25 5,185 ,852 
1849 20 4,285 6,763 8 2,715 3,919 28 7,000 10 ,682 
1850 18 3,673 5,871 12 2,393 3,547 30 6 ,066 9,418 
1851 18 3,902 5,850 15 6,824 10,180 33 10,726 16 ,030 
1852 15 2,707 4,038 20 12,695 18,984 35 15,402 23 ,022 
1853 20 4,599 7,538 43 21,321 31,455 63 25,920 38 ,993 
1854 25 5,479 9,143 82 39,274 56 ,310 107 44,753 65 ,453 
1855 35 4,633 8,066 84 42,541 64,261 119 47,174 72 ,327 
1 32 6,350 10,273 7i 27 ,355 38 , 567 103 33 ,705 48 ,840 
1857 42 4,061 8,521 27 ,045 ,432 110 31,106 46 ,953 
1858 34 4,956 8,081 41* 29 nd 41,344* 73* 34 ,219* 49 ,425* 
1859 30 8,319 13 ,734 ash 15,546, 21,663t% 7st aan | 35,397 
1860 30 6 434 13 ,462 7 32,924 45, 4612 1 39, ,923 
x Total Wood and Iron Steamships 

ear 

of Paddle Screw Total 
Build 

No. Net Tons Gross Tons No. Net Tons Gross Tons No. Net Tons Gross Tons 

1823 + 427 817 _ ae 4 427 817 
1824 4 341 682 _ - — 4 341 682 
1825 5 419 894 _— —_ 5 419 894 
1826 ll 964 1,831 — — — ll 964 1,831 
1827 3 451 662 — — — 3 451 662 
1828 3 63 141 _ — — 3 63 141 
1829 4 301 613 — — _ 4 301 613 
1830 5 443 752 _ — — 5 443 752 
1831 + 447 725 —_ — — 4 447 725 
1832 8 910 1,491 oa — — . 910 1,491 
1833 8 503 1,222 _ _ — 8 503 1,222 
1834 17 2,350 4,042 _- — ~ 17 2,350 4,042 
1835 20 2,273 4,187 1 368 473 21 2,641 4,660 
1836 20 2,667 4,724 -- a a 20 2 ,662 4,724 
1837 33 4,022 6,962 -- -- -- 33 4,022 6,962 
1838 36 1,963 3,845 — _ _- 36 1,963 3,845 
1839 21 1,768 3,291 _ — 21 1,768 3,291 
1840 30 3.512 6,519 2 2,026 3,864 32 5,538 10 ,383 
1 34 10 ,236 16,192 —- — — 34 10,236 16,192 
184 32 5,552 9,784 1 116 171 33 5,668 ,955 
1843 33 2,216 4,328 —_ _ _ 33 2,216 4,328 
1844 52 6,506 11,578 3 255 400 55 6,761 ll, ’978 
1845 41 4,053 7,289 3 424 651 44 4,477 4 
1846 43 6,616 10,952 il 2,879 4,043 54 9,495 14,995 
1847 78 12,315 21,005 6 1,777 2,458 84 14,692 ,463 
1848 65 6,024 11,252 6 1,667 2, 71 7,691 14,251 
1849 42 6,462 10,578 8 2,715 3,919 50 9,177 14, "497 
1850 30 7,198 12,139 14 2,942 4,245 44 10,140 16/384 
1851 38 9,081 14,608 16 6,927 10,331 ' 54 , 24,939 
1852 50 5,376 9,357 22 12,875 19,249 72 18,251 28, 
1853 51 8,170 13,777 46 21,542 31,776 97 29,712 45,553 
1854 51 7,805 13 ,349 85 40,801 58 ,499 136 ‘ 71,848 
1855 63 7,868 13 ,394 86 42,703 64, 149 50,571 * 
1856 75 9,118 15,992 73 27 ,518 38,799 148 36, 54,791 
1857 97 5,127 12,578 73 27,208 38,745 170 32 ,335 1,323 
1858 67 7,201 127791 43 29 ,327 41,456 110* 36 ,528* 54 ,247* 
1859 69 9,090 16 ,353 5st 16,404} 22 ,845* 122t 25 ,494t 39 ,198t 
1860 65 7,530 16, "231 33,11 45,736" 1 40, 61 ,967R 











Accounts and Papers, 1861, LVIII (371). 


Source: 
nS Se eon t Eastern,” 13,344 net tons and 18,915 gross tons; and 1 steel ship, 79 net tons and 111 


oss tons 
° Includes 1 steel ship 129 net tons and 204 gross 
ships, 265 net tons Reece. 


Includes three s' 





FIRST 1,945 BRITISH STEAMSHIPS 365 


Cart 365 


i 
of 
a 


g 8 8 
8 
T 


ee wee | in, the UK (1000 tons) 
' 
trade 
; 


be puso ho Yes 


=] 
T 
oa -n £2 








arrestee ee er ee ee ee 
es sa awaaeaesitseisxneisrewrneat4 8s &6& TFT we SH SS SS SF 86 





the peaks in ship construction in 1847 and 1855 are easily accounted for (below), 
the 1840(41) peak in shipbuilding in Britain is not adequately accounted for in 
the literature and remains largely unexplained. It certainly does not conform 
to movements in other data of economic activity.® 

The information contained in Table 363 alone is sufficient to justify a rather 
sharp revision of accepted interpretations of the growth of the British steam 
merchant marine up to 1861. It is seen in Table 363 that some 60 per cent of 
all the gross tonnage of steamships added to the fleet in 1823-60 were added 
in the seven years 1854-60. Moreover, about 85 per cent of this gross tonnage 
consisted of iron ships and roughly 70 per cent of iron-screw steamers. As early 
as 1844 the gross tonnage of iron steamers exceeded that of wood® and by 1851 
the tonnage of iron-screw steamers alone exceeded total wood steamers (both 
paddle and screw).'° Hence the well-known view that “iron had not been 
generally adopted for steamers before the decade 1855-65”" is rather far from 
the mark. From 1851 onwards, the additions of iron-screw steamers to the fleet 
led all other types and by 1853 iron-screw steamers clearly dominated all others 
combined in the growth of the steam merchant marine. Considering the data 
in Tables 363 and 367 it ought to be noted that G. S. Graham’s recent state- 
ment 





* See Gayer, A. D., Rostow, W. W., and Schwartz, A. J., The Growth and Development of the British Economy 
1790-1850, Oxford, 1952, 1, pp. 276-303 for other economic data in the period. 

® 7,347 gross tons of iron steamers compared to 4,631 gross tons of wood. Table 363. 

1° 10,180 gross tons of iron-screw steamers, 5,850 gross tons of iron paddle and 8,909 gross tons of wood steamers 
of both kinds. ‘ 

u J, H. Clapham, Econemic History of Modern Britain, Cambridge, 1942, II, p. 63. Clapham’s discussion also 
suggests rather strongly, p. 64, that most of the steam fleet in the 1850's was still wood. L. C. A. Knowles places the 
beginnings of success for the iron steamship in the transport of passengers and cargo only “Between 1850 and 
1860.” The Industrial and Commercial Revolutions in Great Britain during the Nineteenth Century, London, 1926, 
p. 298. 
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“... iron proved to be a stubborn and uncertain medium, especially in the hands of 
craftsmen trained to handle timber, and by the mid-1850’s had superseded wood only 
in the construction of steam passenger or small experimental war vessels... ”!? 


should not be accepted without considerable reservation. Not only did the 
tonnage or iron steamers make up the great bulk of all steam tonnage added 
to the fleet “by the mid-1850’s,” but the average size (reflected in average 
transport capacity in Table 367 below) of wood-paddle steamers in the 1850’s 
was but a fraction of the average size of iron-screw steamers that comprised 
70 per cent of all the steamers added to the merchant fleet. Unless one argues 
that there are no ships except “steam passenger” ships in Tables 363 and 
367 (no war ships are in the data) Graham's statement needs revision. 

Rapid obsolescence was a characteristic of these early steamers, so that by 
1860 only 929 steam vessels (excluding river steamers) were employed in the 
coastal and foreign trade.“ The reasons for rapid obsolescence are obvious. 
As is usually the case“ in major transport innovations, changes in design, 
engines and construction came rapidly once the adoption of the basic innova- 
tion became widespread; i.e., once the shift to iron-screw steamers accelerated, 
and competition insured that the shift would in fact continue. The transport 
advantages of iron-screw compared to other types of steamships are clearly 
seen in Table 367 where the average transport capacity of iron-screw 
steamers was in general much greater by the mid-1850’s than that of any 
other type of steamer. 

Of course the rate of obsolescence varied from ship to ship, but it appears 
that few of the steam ships built before the 1850’s remained in service for 
much longer than a decade at most. The “Brittania” of 1840, the first steam 
Cunarder in regular trans-Atlantic service, was retired from that service in 
1849; and Cunard’s famous iron-paddle steamer “Persia,” launched in 1856. 
was retired from the Atlantic service in 1868, after a scant 12 years of service." 

Given the high rate of obsolescence and the fact that most of the British 
steam merchant fleet of 1860 had been built within a decade of that date, it is 
clear that by 1860 the British steam merchant marine was iron, and moreover, 
iron screw.’? The implications of this fact can be appreciated by inspection of 
the measures of transport capacity in Table 367 below—the earning capacity 
of this steam fleet was growing prodigiously. In fact, even though by 1860 
steamers accounted for only 10 per cent of total British register tonnage, the 
total freight earnings accruing to these steamers must have been much greater. 





2G. 8S. Graham, “The Ascendency of the Sailing Ship 1850-85,” Economic History Review (Second Series), 
Vol. IX, No. 1, August 1956, pp. 75-6. 

% Statistical Abstract. 

“4 Consider the experience of air and highway transportation, as well as the early history of rail transporation. 

8 See next section for a di ion of this measurement. 

% Private commuzication to the authors from the Cunard Steamship Company Ltd. It should be emphasized 
that some steamers in less competitive routes might enjoy a comparatively long service. By the end of the 19th 
century the rate of obsolescence slowed down significantly as designs and engines had passed through many ex- 
perimental stages. 

17 There were in fact already 5 steel ships in the registry by 1860 (Table 363). Clapham, op. cit., pp. 56, 62 found 
bis first steel steamships in the merchant fleet only in 1863-65. The data in Table 363 make it possible to avoid 
repetition of earlier errors concerning the size of Britain's early steam merchant fleet. For example, Gayer, Rostow, 
and Schwartz, op. cit., p. 286 cite G. R. Porter's Progress of the Nation to the effect that by 1842 some 4 per cent of 
registered British shipping was steam. This is obviously an overstatement. Total tonnage under British registry 
in 1842 was 2,991,000, 4 per cent of which would be 119,640 register tons; according to the data in Table 363 only 
about 70,000 gross tons of steam shipping had been added to the British merchant fleet by 1842, and much of that 
must by 1842 have been rendered obsolete and taken out of service. 
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TABLE 367 


RATIO OF TOTAL NET TO TOTAL GROSS TONNAGE, TRANSPORT 
CAPACITY PER SHIP AND TOTAL TRANSPORT CAPACITY FOR 


EACH YEAR FROM 1823 TO 1860 








Wood Steamships 





Screw 





Total 
Transport 
Capacity 


Total 
Capacity Transport 
Ship Capacity 





Bee68 2E38 


545 
-503 
-464 
-571 
562 
-691 
-502 


BE 883 


> 
N 
xn 





3% 


® 
NHONOS AWOSCe at 


o 
SBOON COONN BOe MOHD SOOM 


ue 


1 OM 
~ 
wo 


2g 


Rentets MOO oe 
<7 
SIZES Sessa See 
OOP NI eT wWwOOwrs CHrRNOCO COKOKR CHRON COOCOCNW NO 


to 





®NSQwS NOON 





tar 


Onno RON ® 


BS g5f8e 


Hews) 
85 


BSmom sow com mooce 
S8288 Sef8e S32 
cow nee 


SS588 
Ba 


nue ek oO 
Cam Ib 


e+e 


BS abies 
Sso8 S855 838 


Sennen BROCoR YOEBwWO CHRIS BHOR® SCBONN Bo 


Cato 

PPPS RAPS 
o w 
BSse § 


% 
3 5 


SEREE Sates 
$23 


see 


3 








Iron Steamships 





Paddle 


Screw 


Total 





Ratio of 
Net to 


Gross 
nnage 


Transport 


Capacity 
per Ship 


Total 
Transport 
Capacity 


Ratio of 
Net to — 
Gross PS ry 


Tonnage per Ship 


Ratio of 
Net to 
Gross 

Tonnage 


_—— 
‘apacity 
per Ship 











3,275.2 3,275.2 





778 


3,275.2 








Capacity 
per Ship 


Tonnage 








Total 
Transport 
Capacity 


Capacity 
per Ship 
Total Iron and Wood Steamships 


Gross 
Tonnage 


Ratio of 
Net to 








3 
= 
& 
: 
i 
© 
=) 
m 
= 
a 
< 
e 


Total 
Capacity 


it 
- 
Lal 


S528 


Transport 
Capacity 
Transport 














Ratio of 
Net to 
Gross 

Ton 
Ratio of 
Net to 

Gross 








Year 
of 
Build 
Year 
of 
Build 




















FIRST 1,945 BRITISH STEAMSHIPS 369 


By 1865 steamships accounted for some 20 per cent of the total register tonnage, 
but Sir John Clapham thought it possible that the steamers did more transport 
work than all the sailing ships combined.'* The argument applied (although to 
a lesser degree) to the late 1850’s as well; the transport superiority of Britain’s 
new steam fleet no doubt made important contributions to Britain’s balance 
of payments.'® To the extent that this was so, the rising trade deficit incurred 
from the 1850’s on,. ards was offset more easily year by year because of freight 
earnings, especially since the proximity of coal and iron to the sea, together 
with British shipbuilding skills, brought a decisive shift in comparative ad- 
vantage in shipping to Britain.”° 

Data exist which tend to support this view. It can re seen in A. H. Imlah’s 
estimates of the current account shown in his pioneering work on the British 
balance of payments. They show that the current account was especially 
strong (almost continuously in surplus from 1856 to 1874) in the third quarter 
of the 19th century, and that one of the main elements of this strength came 
from the sharp increase in net shipping earnings. If Clapham was correct, then 
the rapidly growing steam fleet (and its capacity to transport cargo, below) 
was no doubt an important contributor to Britain’s commercial strength in the 
period. 

By the late 1850’s the development of the iron-screw steamer had made it 
already clear that Britain would gain greatly in her share of ocean freight earn- 
ings in the last half of the 19th century. The growth of the tonnage of iron- 
screw steamers in the British steam fleet shown in Table 363 underlines this 


point graphically. In terms of transport capacity, Britain’s new steam fleet 
was even more impressive by 1860. 


4. THE MEASUREMENT OF TRANSPORT CAPACITY 


The customary measure of the cargo transporting capacity of a ship is its 
gross tonnage; the customary measure of the amount of shipping capacity 
built (or existing) in any year is the sum of the gross tonnages of the ships in 
question. 

The adequacy of this measure of the amount of shipping is vitiated in the 
present case by the fact that important and rapid changes in design and con- 
struction of ships and engines took place during the period covered. Compari- 
sons of the data for individual ships early and late in the period indicate that 
changes were made in hull design, evidenced by changes in the magnitudes and 
ratios of the linear dimensions of ships and in the changing composition of the 
fleet as between wood and iron hulls. Changes were made in engine design,” 





18 Op. cit., p. 72. 

19 Although the data in table 367 do not include a breakdown between ships employed in river or coastal 
trade, and those employed in ocean trade, it should be borne in mind that many of the smaller steamers were used 
only in river or coastal trade. Again, it is also possible that many of these steamers were used mainly for passenger 
service. It would be unwise therefore, to overemphasize the contribution made by the steamers to the balance of 
payments. 

2° Largely at the expense of the American merchant fleet. Note the failure of American construction of steam 
and motor ships to continue to grow, in terms of tonnage, from 1853 to the 1880's. Historical Statistics of the United 
States, Washington, 1949, p. 211. Also the sharp decline after the 1850's (beginning then) in the growth of American 
tonnage employed in foreign trade leading to absolute decline beginning in 1862. Ibid., pp. 207-8. 

2 “British Balance of Payments and Export of Capital 1816-1913,” Economic History Review (Second Series), 
Vol. V, No. 2, 1952. 

22 See Graham’s account, Passim; of changes in steam engines employed in ocean shipping. For an excellent 
account of the technical improvement of shipping during this period see Edgar C. Smith, A Short History of Naval 
and Marine Engineering, Cambridge, 1937, esp. pp. 94-186. 
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evidenced by systematic differences in the portion of gross tonnage occupied by 
engines and fuel for ships of given horsepower; changes were made i» propul- 
sion equipment, evidenced by the changing composition of the fleet constructed 
year by year in favor of screw propulsion at the expense of the paddle. In these 
circumstances a gross ton does not denote the same cargo transporting capacity 
for every ship. Thus, the changing composition from year to year of the fleet 
constructed means that an aggregate gross ton does not signify the same cargo 
transporting capacity in different years. In view of the rapid rate of technologi- 
cal advance in this period we are led to seek a more satisfactory measure of 
the amount of shipping constructed, where we view shipping as transportation 
capacity. 

The transportation capacity of a ship may be considered its ability to move 
cargo a given distance in a given time. More specifically, we can consider the 
cargo transporting capacity of a ship to be: 


Number of cargo ton — miles per hour = ( 


net eg a ‘gra 2) 
of the ship of the ship / 


We shall call this quantity the cargo transporting capacity, or more briefly, 
the transport capacity of the ship.* By summing the transport capacities of the 
ships built in any year we obtain a measure of the transport capacity of the 
fleet built in that year, which we may call the aggregate transport capacity. 
The validity of this measure of shipping capacity is not affected by changes 
in the composition of the fleet from year to year or by changes in the desigr of 
ships, because the significant consequences for cargo carrying capacities of 
such changes are measured directly by this quantity. Thus, changes in hw! 
design of ships produce changes in the space available for cargo, anc in their 
speed; changes in engine design similarly produce changes in the space avail- 
able for cargo and in speed; changes in design of propulsion equipment similarly 
affect cargo space and speed. (E.g., more efficient propulsion equipment makes 
it possible to reduce the size of engines and hence the space taken by engines 
and fuel without sacrifice of speed; or to increase speed without reducing cargo 
space.) 

Computation of the transport capacity in the present case was impeded by 
the fact that the official data do not include the speeds of ships. They do, how- 
ever, include the indicated horsepower of the engines and the linear dimensions 





® A fully satisfactory measure of transport capacity would include a measure of that portion of port time (time 
spent in docking, loading and unloading) attributable to the ship. Our presumption is that port time, like sailing 
time, decreased as the technology of the steamship developed and spread into use. To the extent that this was so 
the transport capacities shown in Table 367 tend to understate the levels anc. the rate of increase over time of the 
actual capacity of the fleet to transport cargo. The presumption that port tirse would be shorter for steamers than 
for sail is supported by the following considerations. 

Steamers represented substantial commitments of capital. Taylor estimates that “a steamship ordinarily 
cost three or four times as much as a sailing packet of comparable size.” Taylor, George R., The Transportation 
Revolution 1816-1860. Rinehart and Co., Inc., New York, 1951, p. 119. 

The large commitment of capital, the relatively smaller cargo space and the substantially larger proportion of 
port time to total time of transport for steamers would in the presence of the spur of competition, provide substantial 
inducements to reduce port time. Moreover the relatively rapid deterioration of iron steamers from rusting would 
provide an additional inducement in the same direction. 

We have only isolated pieces of information on port time, and none of it separates that part due to port facilities 
from that part due to the ship. However, what data we have tends to support the view that port time was less for 
steamers than for sail. This is clear in the case of the coal trade, where the introduction of steam colliers encouraged 
the introduction of power equipment at docks to facilitate loading and unloading (see p. 378, 379 below). 
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of each ship. Because the speed of a ship of moderate speed depends to a first 
approximation on the power available to drive it and on the resistance due to 
skin friction with the water, it was possible on the basis of this information to 
estimate the speed of each ship by means of certain conventional formulas to 
be found in standard reference works on naval architecture and marine engi- 
neering. The speed of each ship was estimated by means of the following 
formulas 


3 P 
V=C:- 4/ 2,000 — (1) 


S = L{1.5H + .73B} (2) 


In these formulas, P denotes horsepower, S, the wetted surface of the ship in 
square feet, V the speed, in nautical miles per hour, L the length of the ship 
in feet, H the mean molded draft (depth of hold) in feet, B the width in feet, 
and C a constant determined from data. 

The transport capacity of the fleet could then be determined from the esti- 
mates made of the transport capacities of the individual ships using their 
estimated speeds. 

Those considering using this measure of cargo-carrying capacity must settle 
for themselves the question whether it is preferable to use a measure correct in 
principle, but subject to an error of unknown magnitude analogous to an error 
of measurement, or to use an alternative measure, which is in principle un- 
satisfactory, but whose value is perhaps known more precisely.” 

The Cunard Steamship Company was generous in making available the 
speeds of some twenty ships built during the period 1845-1861. 

Trial computations for these ships led to the decision to divide the ships 
into two groups, those built prior to 1848, and those built after 1848. For those 
prior to 1848 the value C=1.11 was used, for those after 1848, the value 
C=1.39 was used. The following table shows the estimated and actual speeds 
and the error for the ships for which the speed is available. 

The measures of transport capacity obtained with the aid of these formulas 
are given in Table 367. This table also shows the ratio of net to gross tonnage 
in each year (in effect a weighted average of the ratio of net to gross tonnage of 
each ship weighted by gross tonnage), and the transport capacity per ship in 
each year. These data enable us to compare the fleet measured by transport 
capacity with the fleet measured by gross tonnage. 





% The formulas are taken from Taylor, D. W., The Speed and Power of Ships; a Manual of Marine Propulsion, 
1933, pp. 171, 20-21 respectively. 

Formula (1) is obtained from Kirk’s Formula by taking the constant & =4, the figure suggested by Kirk for 
ships with a high coefficient of fineness. 

Formula (2) is obtained from Normand’s formula by taking the block coefficient equal to .65, the value given 
by Hughes, Charles H., Handbook of Ship Calculations Construction and Operations, 8rd Edition, New York and 
London, 1942, p. 170. See also Machrow, Clement, and Woollard, Lloyd, The Naval Architect's and Shipbuilder’s 
Pocket-book of Formulae, Rules and Tables and Marine Engineer's and Surveyor’s Handy Book of Reference, Fourteenth 
Edition (Revised), pp. 175, 176. 

Judgements involving the shapes of ships, based on examination of these data, on their linear dimensions and 
on such illustrations of early steamships as were available to the authors, were used as a basis for selecting the values 
of the two constants, &, and the block coefficient, used in these formulas. The arbitrary nature of this method of 
choice was partly offset by the introduction of the constant C, which was estimated from numerical data, see below. 

% These hard alternatives may be softened by the fact that there is some information about the errors due to 
estimating speed. See Table 372 below. 
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TABLE 372 


ACTUAL, ESTIMATED SPEED, AND ERROR FOR VARIOUS SHIPS 
BY YEAR OF BUILD 








Actual Speed in Estimated 
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Source: Private Communications to the authors from Cunard Steamship Company, Ltd. 


In Chart 373 we plot the logarithms of gross tonnage and transport 
capacity, respectively, against time. These series move closely together, except 
perhaps for the period 1848 to 1854. 

Linear regressions of these series on time yield the least squares trend lines 
shown in Chart 373. It is seen from these that the original series have ex- 
ponential trends, and that transport capacity increased during this period at a 
faster rate than did gross tonnage. The method of estimating the speeds of the 
ships affords grounds for the belief that the computed transport capacity may 
overstate the capacity of ships built early in the period and understate the 
capacity of those built late in the period; so that the rate of increase of trans- 
port capacity is lower than it should be. 

The equations of the trend lines are: 


estimated gross tonnage = 2.63 — .061 X (time) 
estimated transport capacity = 3.26 — .067 X (time) 


The data in Table 367 sharply amplify the preponderence of iron steamers 
in the total fleet; and also indicate some advantages of screw propulsion over 
paddle. By the 1850’s iron steamers had outstripped wood both in total and in 
average transport capacity. Wood-screw ships did not consistently exceed 





%* The squares of the standard errors of estimate for these regression lines are .06 and .07 respectively. 
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wood-paddle steamers in transport capacity, but did in terms of the ratio of 
net to gross tonnage—indicating the larger amount of free cargo space which 
screw propulsion made available compared to paddle. It is notable that, in this 
respect, wood-screw ships were superior to iron paddle ships and even compared 
favorably to iron-screw steamers. Nevertheless, as is seen in Table 367, very 
few wood-screw steamers were placed under permanent registry in this period. 
This evidence supports the view that wood had disadvantages for ocean 
transport for which the compensation of screw propulsion and the advantages 
of long familiarity in construction were insufficient. 

By the 1850’s the highest total and average transport capacities had clearly 
passed over to iron-screw steamers. The sharp decline in both the total and 
average transport capacities of the wood ships after the mid-1850’s probably 
indicates that the wood steamer was increasingly used only for river transport, 
and new wood steamers were no longer being purchased for employment in 
the ocean shipping routes even though a considerable number of wood-paddle 
steamers were still being placed under permanent registry (Table 363). 

The transport capacity measures in Table 367 together with the data in Table 
363 suggest some significant points about the role of the steam fleet compared 
to the sailing ships that still comprised the bulk of British merchant ships 
under British registry in the mid-19th century. As was argued above, the freight 
earnings from the new steam fleet doubtlessly added considerably to Britain’s 
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ability to reverse the pattern of her balance of payments and to run almost 
persistent current-account surpluses from the mid-1850’s until the mid-1870’s. 
These earnings no doubt came mainly from the shorter trade routes. The actual 
conquest of the shorter trade routes by British steamers has not yet been meas- 
ured in quantitative terms (or dated), but on the basis of known historical 
evidence it is thought that the conquest was extensive by the late 1850’s.”’ 
Even so, there has not been general agreement on the actual degree to which 
steamers supplanted sail in the mid-i$th century. Sir John Clapham, for ex- 
ample, thought it a “great exaggeration” when C. M. Palmer stated in 1864 
that “iron steamers [did] most of the carrying trade of the Baltic and Medi- 
terranean.”?* But the data adduced in this paper clearly show that this need not 
have been an exaggeration at all. Indeed, the shoe would seem to be on the 
other foot. We have seen that by the end of the 1850’s the carrying trade, if it 
were being done by steamer (and Clapham never argued that it was still de- 
pendent upon sail) must have been done by iron steamers, and indeed, by iron- 
screw steamers—all other kinds of steam ships taken together having been re- 
duced to very small proportions of the British steam merchant fleet in the 
1850's. 

In a similar vein, G. 8. Graham has recently argued against what he con- 
siders to have been the undue emphasis placed by historians upon the role of 
the steamship in the 19th century development of the British empire.?® The 
point may be well taken, but it needs quantitative support. If by “Imperial 
development” one means British overseas investment in the colonies, then the 
steamer, by its freight earnings in the North Sea, Baltic, Mediterranean and 
North Atlantic routes, was without doubt an important contributor to the 
strong balance of payments position which sustained British overseas invest- 
ment in the last half of the century. Graham makes much of the inability of 
the steamers to compete with sail on the long routes to India, China, and 
Australia. But considering that more than half of British overseas trade was 
condueted within the possible steamer routes (Europe accounted for between 
30 and 45 per cent, North America for another 10-30 per cent*® and this cargo 
trade data does not include the very profitable emigrant passenger trade in 
which steamships participated), and considering that the carrying trade of 
Europe was within the sphere of British steamers, it would seem that it must 
be shown in quantitative terms that freight earnings from the long-haul cargos 
of sail were greater than steamer earnings if Graham’s point is to be substan- 
tiated.** Until such evidence is forthcoming it would not be wise to write off 
the “traditional” emphasis on the role of steam. The data in Tables 363 and 
367 above show that by the end of the 1850’s Britain’s steam merchant fleet 

" E.g. Graham, op. cit., p. 74. Throughout this account there is scattered evidence on this problem. G. R. 
Taylor states that “ . . . steamships had, during the fifties, been able to take over the best-paying part of the ocean 
trade.” He believes that in total world trade, by 1860, the sailing ship still prevailed “very largely.” The Transporta- 
tion Revolution, New York, 1951, p. 119. 

%8 Op. cit., p. 71. 

% Op. cit., pp. 74-75. 

%° See Werner Schlote, British Overseas Trade from 1700 to the 1930's, Oxford, 1952, Tables 18, 19, pp. 156-159. 

™ And this may be conceivable if data for freight earnings can be uncovered, since bulk cargos of the kind 
carried by sail may well have earned more freights (in aggregate) than manufactured goods and passengers. The 


authors are indebted to Douglass C. North for this observation based upon his as yet unpublished investigation of 
19th century ocean freight rates. 
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was not the wooden assortment of paddle wheelers that many have envisaged 
(above), but that it was mostly iron-screw, of recent build and growing with 
increasingly greater transport capacity. It was a commercial weapon worthy 
of the importance assigned to it by its contemporaries. 


5. THE HISTORICAL SETTING 


New techniques have only rarely affected major changes in economic activity 
with what might be called startling rapidity. The case of the iron-screw steam- 
ship was no exception. Indeed, by the mid-1850’s when the triumph of this 
kind of ocean transport in the steamship routes was obvious to contemporaries 
(and visible on the ways of British shipyards) the steamship already had a 
history stretching back half a century. The iron-screw steamer had been in 
use for at least two decades, and a great and colorful age of commercial sail 
on the longest ocean routes still lay in the future.* Yet, given the history of 
experimentation and growth in earlier decades, the rapid increase in the num- 
ber and tonnage of iron-screw steamers in the British merchant marine in the 
1850’s together with the resulting displacement of other forms of ocean trans- 
port, constitute a striking development; and was certainly an outstanding 
example of the force of innovation in the record of British economic growth in 
the period. 

The accelerated introduction of the iron-screw steamer in the 1850’s can 
best be understood in its own historical context. The demand for iron-screw 
steamers depended mainly upon the needs of overseas trade. A major innova- 
tion such as this one might successfully make inroads upon the markets of 
older and less efficient kinds of shipping because of cost advantages, but a rapid 
expansion of the total market would considerably ease the problem of entry, 
especially if existing shipping capacity were very nearly utilized. This would 
presuppose a strong growth of foreign trade, which in turn would, ceteris 
paribus, be related to a sharp rise in domestic and foreign economic activity. 
Such conditions prevailed in the decade of the 1850’s. Moreover, the rise of 
a competing form of transportation, the railway, threatened the coastal trade 
in coal and virtually forced the quick adoption of the iron-screw steamer in the 
coastal transport of coal. Here was a process familiar in economic history— 
innovation giving rise to innovation on a grand scale.* 

The British economy evolved slowly from its mainly agricultural base 
toward an “industry state” in the half-century following the end of the French 
wars in 1815. The growth of industry and trade was uneven, but with the 
steady increase in population, internal improvements in industrial techniques, 
transportation, and communications together with the increasing movement of 
laborers from the countryside into the industrial towns,™ the transformation 





* Gerald S. Graham, op. cit., p. 75. 

% For an extensive discussion see, for example, Paul Mantoux’s discussion of the effects of Kay's flying shuttle 
on the textile industry, The Industrial Revolution in the Eighteenth Century, London, 1949, pp. 211-3; and Kar! 
Marx, Capital, Vol. I (Glaisher, ed.) London, 1918, pp. 379-82, on the effects upon invention of disparate increases 
in productivity in related industries caused by innovations; e.g., . . . it was only during the decade preceding 1866, 
that the construction of railways and ocean steamers on a stupendous scale called into existence the cyclopean 
machines now employed in the construction of prime movers.” p. 380. 

* This movement was at its height in the 1840's; see A. K. Cairncross, “Internal Migration in Victorian 
England,” Manchester School, 1949, vol. xvii, p. 70. 
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became ever more perceptible. The outflow of British people (and other Euro- 
peans) to America and to the colonies* insured a growth of British overseas 
trade in industrial products. This growth was augmented by commercial de- 
velopment elsewhere and by the ever-growing interdependence of commercial 
and financial activity on a world scale. In a very real sense, trade engendered 
more trade. At the center of this complex of markets the growing British in- 
dustrial economy became increasingly dependent upon overseas supplies of 
raw materials, and (by the mid-19th century) of food. 

The effects upon Britain’s overseas trade are to be seen in Chart 365. Even 
before the great upsurge of economic activity that characterized the decade 
of the 1850’s, the movement of British foreign trade was strongly upward in 
volume terms. This no doubt provided enough of a growth element in the 
ocean-freight market to buoy up the expectations of investors in steamers. 
Thus in the late 1830’s and twice in the 1840’s there were fairly marked in- 
creases in the construction of steamships, either wood paddle or iron paddle, 
to be added to the British merchant fleet. But as can be seen in Table 363 it 
was in the years immediately after the Californian and Australian gold dis- 
coveries and during the Crimean War that the character of steamship con- 
struction changed radically and the iron-screw steamer clearly displaced wood 
and iron paddle steamships in United Kingdom shipyards.* 

While the rapid growth of iron-screw steamer construction is evident as 
early as 1849, the years 1850 to 1855 saw a spectacular increase in their con- 
struction and the introduction of the iron-screw steamer was decisive. The 
boom in overseas trade (especially raw-material imports and also exports of 
British commodities to America and Australia up to 1854, together with a 
sharp rise in exports to the Continent and to the Middle East in 1855) followed 
by military transport requirements while trade remained at very high levels, 
led to a strong demand for new shipping capacity, and during the war, especially 
for steamers.*? 

The rise in ocean freights even before the outbreak of War with Russia in 
1854 was very marked and spurred the efforts of shipbuilders and ship specula- 
tors. From the first quarter of 1852 to the third quarter of 1853 average cargo 
freight rates from British ports to New York rose from 15s to 28s per ton. 
During the same period, freights on (measurement) goods to Calcutta rose 
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% From 1845 to 1850 alone roughly 1.3 million people emigrated from the United Kingdom. Statistical Abstract. 

%* The change to iron was marked in sailing vessels as well. “As regards iron-built sailing vessels, the most re- 
markable feature of our trade is the very increasing favor they are now growing into, and which are now occupying 
the builders both here [Liverpool], on the Clyde, Newcastle and elsewhere to an unprecedented extent.” Circular to 
Bankers, January 8, 1853. By the middle 1850's a shrewd and observant contemporary, Thomas Tooke, without 
distinguishing between kinds of iron ships, saw clearly the change that had taken place. “The number of Iron Vessels 
now owned by this country is very great, and it is rapidly on the increase. On the Clyde it is a rare sight to see a ship 
built of wood. The Tyne-built are, also, almost all of iron. At Liverpool also, iron is becoming much used; and even 
at Sunderland, where they have been so long accustomed to wooden vessels, some of the leading builders are directing 
their attention to iron.” History of Prices, London, 1857, Vol. V, p. 331. 

3? The striking rapidity and magnitude of the response of the shipbuilders to the stimulus of increased demand 
was partly made possible by the flexibility of the Sritish financial system of the period. Ships were financed on the 
basis of long-dated bills of exchange which were accepted and di ted like ordinary trade bills in the London 
discount market. The system was largely discredited in 1866 when the great discount house of Overend and Gurney 
failed and their underwriting, by this means, of the notorious ship speculator, Stephanos Xenos, became known. 
In Accounts and Papers, 1861, LVIII (371) Xenos appears prominently from the mid-1850’s onwards in the London 
registrations. 
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from 22s to 42s per ton, and to Melbourne, from 42s to 85s per ton.** Such 
increases of as much as 100 per cent reflect the shortage of shipping capacity 
that existed in the wake of the trade expansion of the early 1850’s. But when 
war came early in 1854 freights rose even higher in response to government 
demand for troop transports to the Black Sea. . 

Immediately following the outbreak of war government transport demand 
had been mainly for sail, but several large steamers were chartered from the 
principal steamship companies and these ships soon proved their superiority. 
They were mostly of large size*® and made the voyage from Portsmouth to 
Constantinople in from 12 to 15 days. The same trip for sail was an estimated 
40 to 50 days.*® Accordingly government charter demand swung over to 
steamers: there were 31,900 tons of steam under government charter by June 
1855 compared to only 15,800 tons of sail; a year earlier these figures had been 
virtually the reverse, 14,000 tons of steam and 39,800 tons of sail.“! By the end 
of 1854 the government was paying 55s to 60s per register ton for steamers; 
25s to 30s had been the charter price at the beginning of the war.” 

There developed a shortage of steamers in almost every route when govern- 
ment demand was met. 


. . . though every line of the trade has been drained, even to the derangement of our 
postal services, sufficient steamers cannot be found.* 


By late 1854 it appeared that British shipping capacity was strained virtually 
to the limit. Wages of shipbuilders and seamea by 1854 had risen 50 per cent 
or more compared to 1850.“ The rise in shipbuilders’ wages reflected both 
private and government demand for newly constructed ships. Of the 354 
steamships added to the British Navy from 1848 to 1859, 284 were purchased 
ir. the three years 1854-56.“ As can be seen in Table 363 the increased tonnage 
of steamer construction (and especially of iron-screw steamers) in the early 
1850’s reached boom proportions during the Crimean War years. During that 
period the market seemed to warrant a great expansion of steamship con- 
struction—and especially of iron-screw steamers.“ But the Crimean War ship- 
building boom largely accounted for the depression in shipbuilding after 1855; 
excess capacity had been created, and especially was this the case when the 
war ended. 





% Tooke, op. cit., p. 319. 

39 Averaging well over 1,000 gross tons, the largest, the Himalaya, chartered from the Penninsular and Oriental 
Steam Navigation Company, had a gross tonnage of 3,550. Ibid., p. 320. 

40 Thid. 

“ Ibid. 

4 Tbid., p. 321, 

# Times, January 4, 1855. 

« The peak of wage rates reached during the Crimean War represented the highest levels of the whole period 
1848-1860; wages declined from 1855-56 onwards. See Bowley, A. L. and Wood, G. S., “Statistics of Wages in the 
United Kingdom . . . Engineering and Shipbuilding,” Journal of the Royal Statistical Society, 1906, Vol. LXIX, 
p. 518; for wages of seamen, see Accounts and Papers, 1867, LXIV (158). 

Accounts and Papers, 1859 (Sess. 1), XV (180). For total vessels ordered by the Navy and launched in this 
period see Accounts and Papers, 1860, VIII (545), Report of the Select Committee on Navy Guns and Mortar Boats, 





Appendix 3. 
4 Moreover, the steamers constructed were mostly of large size suitable for overseas trade, while the coasta 
trade was neglected. Steam t ge employed in coastal trade fell from 85,500 in 1853 to 54,000 in 1854, and was 


only 57,400 in 1855. Times, January 4, 1955; Statistical Abstract. This no doubt helped to shift the transport of coal 
to the railways during the War years and especially in 1855. See below pp. 378-380, especially Table 379. 
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As can be seen in Table 363, the decline in shipbuilding in the years im- 
mediately following 1855 was sharp—and this in spite of the unprecedented 
levels of trade ruling in 1856-1860. Here the yards specializing in the construc- 
tion of iron-screw steamers were evidently not protected by the technological 
superiority of their product; they were hit as badly by the excess shipping 
capacity built during the Crimean War as were those shipyards still specializing 
in sailing ships.*7 However, even though new construction of iron-screw steam- 
ers was drastically reduced, the superiority of these ships over sail was becom- 
ing increasingly evident and the war-built steamers were evidently capturing 
the shorter trade routes as early as 1856. 


“Surplus steamers are now finding their way, to the advantage of all concerned, into 
trades formerly carried on by sailing vessels.“ 


The Baltic routes especially felt the impact of the Crimean War iron-screw 
tonnage. On these routes the fuel problem was less of a handicap than on other 
routes and by 1856 the intrusion of steamers was a notable feature of the 
Baltic trade. In 1853, 26 steamers had reportedly made 79 voyages from 
British to Baltic ports; in 1856 the number of steamers in that trade had risen 
to 136 and the number of voyages to 334.** Even as early as the beginning of 
1857 some shipowners realized that steam was about to overtake sail in the 
shorter routes.*® By the end of 1858 it was noted that steamers had succeeded 
in the Mediterranean trade." In the coastal trade railway competition had 
already assured the introduction of steam. 

The construction of trunk lines associated with the railway “mania” of the 
1840’s together with the proliferation of branch lines during the 1850’s pro- 
vided Britain with a vastly improved land transportation base for industrial 
expansion. But the new railways also broke the bottleneck in coal transport, a 
field that had formerly been the preserve of the coastal-shipping companies, 
and provided strong competition for the freight earnings of coal haulage. De- 
velopment of a rapid internal transportation system capable of handling coal 
meant that the old wind-driven colliers faced a kind of competition they had 
never known before. Moreover, coal-producing areas not served by direct rail 
connections with London were faced with potential losses of a good part of 
their market in the metropolis and were forced to find a means of retaliation. 
Iron-screw colliers provided the means of competing. 

From 1850 to 1860 coal transported into metropolitan London rose from 
3,639,000 tons to 5,070,000 tons, an increase of 1,431,000 tons. During the same 
period, coal brought into the London district by inland navigation and railways 
rose from a mere 85,000 tons to 1,497,000 tons, or by 1,412,000 tons, almost 
the whole of the increase in the total.” By 1860 coal brought into London by 
coastal shipping stood at 3,573,000 tons, a mere 20,000 tons above the 1850 





47 The shipowners were well aware of their predicament. “In estimating the prospects for the current year, we 
cannot hide from ourselves that we have far too much tonnage afloat for the purposes of our legitimate trade.” 
Times, January 3, 1857. 

48 Tooke, op. cit., Vol. V, p. 331. 

4 Times, January 3, 1857. 

5° “In our opinion the time is rapidly approaching when for short voyages steam will be as a general rule 
substituted for wind.” Ibid. 

© Tbid., January 3, 1859. 

8 Report of the Commissioners Appointed to Inquire into the Several Matters Relating to Coal in the United King- 
dom, Accounts and Papers, 1871, XVIII (C. 435-II), Appendix to Report of Committee E., p. 975. 
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figure, but in 1855 coal brought into London by coastal shipping had fallen to 
just above 3,900,000 tons; at the same time inland transport had risen by over 
one million tons to 1,161,000 tons.* From 1855 to 1860, as is apparent from 
these figures, the coastal trade regained some of its earlier losses to the railways; 
coal brought in by coastal shipping increased more than that brought in by in- 
land transportation. This change in the fortunes of the coastal trade was due 
almost entirely to the introduction of iron-screw colliers. 

The Northern shippers and coal-owners had been fairly quick to perceive the 
advantages of iron-screw colliers in spite of a suggestion to the contrary impli- 
cit in the volume data shown above. As early as 1844 interest was aroused in 
London on the arrival of a Newcastle collier “. . . a fine looking vessel built of 
iron and fitted with a screw propeller, working an engine of 20 horse power.” 
By the 1850’s the beginning of the shift of the English iron industry’s center to 
the North Riding of Yorkshire, following the successful processing of the 
Cleveland “yellow band” iron ores, produced a powerful group of new indus- 
trialists who quickly adopted the iron-screw steamer. One of these men, C. M. 
Palmer, a Northern coal and iron owner as well as a shipbuilder, launched the 
“John Bowes” in 1852. This ship was an iron-screw collier built to carry 650 
tons of coal at 9 miles per hour. On its initial voyage, the “John Bowes” went 
from Newcastle to the Thames in 48 hours, unloaded in 24 hours, and was back 
in Newcastle in another 48 hours, having done a job in five days that “ . . . would 
have taken two average sized sailing colliers upwards of a month to accom- 
plish.”* As can be seen in Table 379, these results were not lost on Northern 
shipowners. 


TABLE 379 


NUMBER OF CARGOES AND TONS OF COAL CARRIED BY IRON-SCREW 
COLLIER, NEWCASTLE TO LONDON BY YEARS FROM 1852 TO 1862 

















Year Number of Cargoes Tons of Coal 
1852 17 9,483 
1853 123 69 , 934 
1854 345 199,974 
1855 174 85,584 
1856 413 238 ,597 
1857 977 547 ,099 
1858 1,127 599 ,527 
1859 899 544,614 
1860 1,069 672 ,476 
1861 1,299 851,991 
1862 1,427 929 ,825 





Source: J. 8. Jeans, op. cit., p. 75. 


The hundred-fold increase in coal shipments by iron-screw collier in a matter 
of a decade was not only a consequence of increased production and use of these 
steamers, but was also a result of rapid technical improvements in such ships. 





8 Ibid. 
% Harry Scrivenor, History of the Iron Trade, London, 1854, p. 311. 
% J. 8. Jeans, Notes on Northern Industries, London, 1879, pp. 74-5. 
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In 1857 an announcement appeared that the “William Cory” was ready for 
service between Cardiff and London. The explicitly stated purpose for building 
this ship was to compete with railway transportation. The “William Cory” was 
designed to carry 1,500 tons of coal—more than twice the capacity of the 
“John Bowes.” The “William Cory” was also fitted out to take on water as 
ballast for its homeward voyage and the Victoria Docks had been supplied with 
hydraulic equipment that would take off the entire cargo in 16 hours. Given 
the pace of technological advance implied by these two illustrations,*’ the data 
in Table 379 are easily enough understood. Although the iron-screw collier 
might have developed rapidly enough on its own, the vigorous railway compe- 
tition of the 1850’s accelerated the introduction of this new form of ocean 
transport. 

The growth of the new iron-steam fleet in this period had broad economic 
implications. The construction of the new iron-steam fleet (and also the adop- 
tion of iron for sailing ships) provided an additional source of demand for iron 
and underpinned expanded investment and output in the iron industry and in 
the closely connected coal industry in the 1850’s.** In this respect the demand 
effects of iron-ship construction on the iron industry were similar to those ob- 
served by Clapham in the period 1873-1883.5* The shift of iron production 
toward the North East during the 1850’s was thus no doubt given an additional 
stimulus by shipbuilding booms in the shipyards of the Tyne and Tees, and in 
Glasgow and other iron ports in the area. The number of steamer registrations 
in the North-East ports in fact rose rapidly from 1851 onwards.® 

The trends in transport capacity and gross tonnage added to the British 
steam fleet 1823-1860 (above p. 372 and Chart 373) are of interest in connec- 
tion with certain theories of economic development. From the time of Malthus 
theorists have speculated on the possibilities of exponential growth, and re- 
cently attention has again turned to growth models characterized by exponen- 
tial growth." If these models can be regarded as referring to average behavior 
of output, then the observed growth of transport capacity (Chart 373) is 
consistent with the hypothesis of exponential growth for the period 1823-1860. 

Converting the regression equation for transport capacity into the form of 
an exponential to the base “e” we have: 


Estimated transport capacity in year t = e&-4'+7-) 


Transport capacity thus grew on the average at the rate of 15 per cent per year 
during the period observed. It is beyond the scope of the present paper to de- 
termine to what extent the growth of transport capacity is representative of 
over-all economic growth in the period. But the transport capacity data cer- 





& Times, July 16, 1857. 

87 A further discussion about the rate of obsolescence in these early steamships is given above. 

58 A brief discussion of iron and coal output and investment in the 1850's will be found in J. R. T. Hughes, 
“The Commercial Crisis of 1857,” Ozford Economic Papers (New Series), Vol. 8, No. 2, June, 1956, pp. 196-7, 204-5. 
For Scotland alone, R. H. Cambell, “Fluctuations in Stocks,” Oxford Economic Papers (New Series), Vol. 9, No. 1, 
February, 1957, p. 43. 

8 Op. cit., p. 61. 

6 With the exception of Sunderland. 

® See for example; von Neuman, J., “AModel of General Economic Equilibrium,” Review of Economic Studies, 
Vol. 13, No. 1, 1945-46, pp. 1-9; Solow, R. M., and Samuelson, P. A., “Balanced Growth under Constant Returns 
to Scale.” Econometrica, Vol. 21, No. 3, July 1953, pp. 412-24. 
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tainly help to raise the possibility of exponential growth out of the realm of 
pure hypothesis; and a cursory examination made by the present authors of ag- 
gregate data for the British economy in the first half of the 19th century sug- 
gests that further research in this area will yield high returns to interested 
scholars. 

The historical background of the boom in investment in steamers in the late 
1840’s and in the 1850’s can add considerably to our understanding of the rapid 
expansion of this means of ocean transport in the period and contributes to our 
understanding of British economic growth in the period. But there remain many 
unanswered questions; certainly until adequate quantitative evidence on the 
actual competitive performance of these ships in the ocean-freight markets is 
forthcoming, the development of the ocean-going steamer in this early period 
will not be fully understood. What were the actuai cost incentives of iron-screw 
steamers compared to other types in this period? How much longer than wood 
ships did the iron steamers remain in service? How long did the growth patterns 
indicated in Tables 363 and 367 above continue after 1860? When did the 
new steel ships begin to displace iron in the new steam fleet? These are questions 
which ought to be profitable areas for further investigation. 


6. SUMMARY 


Beginning in the late 1840’s the growth of the British steam fleet began to 
accelerate markedly and this fleet was mainly built of iron. As early as 1844 the 
gross tonnage of iron steamers added to the fleet exceeded wood. By 1851 the 
gross tonnage of iron-screw steamers alone exceeded all wood steamers. Hence 
the “general” adoption of iron for steamships certainly began in the early 
1840’s, or from 10 to 20 years earlier than has customarily been believed. Some 
60 per cent of the gross tonnage of all steamers added to the fleet in 1823-1860 
was added in 1854-60, 85 per cent of this gross tonnage was iron, and about 70 
per cent was iron-screw. Given the high rate of obsolescence of these early 
steamers, it has been argued that by 1860 the British steam merchant fleet was 
mostly iron-screw and less than a decade old. It was shown that due to techno- 
logical improvements the rate of growth of the transport capacity of this fleet 
was even faster than the gross tonnage, and that iron-screw ships were pre- 
dominant by these measures, and were the most efficient in terms of net cargo- 
carrying capacity. It has been argued that, in the light of these findings, his- 
torians have probably underrated the strength of the British steam fleet which 
existed by the early 1860’s, and that the claims of contemporaries, thought by 
recent historians to have been exaggerated, were probably justified. At least 
there is little quantitative evidence to the contrary. Finally, it has been shown 
that the rapid development of the steam merchant fleet in the 1850’s was 
associated with strong economic expansion both within Britain and in her over- 
seas trade. The growth of the steam fleet was further buttressed by the Cri- 
mean-War transport demand and was given an additional stimulus by the 
growth of railway competition for the coastal trade in coal. Hence a favorable 
economic environment prevailed in Britain for the accelerated introduction of 
this radical innovation in ocean transport. 





THE USE OF RANDOM WORK SAMPLING FOR 
COST ANALYSIS AND CONTROL* 


A. C. Rosanper, H. E. Gurerman, ano A. J. McKeon 
Internal Revenue Service, Washington, D. C. 


Random time sampling of activities (work sampling) is successfully 
applied to everyone in a division of 350 employees for the purpose of 
budget planning and cost control. To insure this success several techni- 
cal and administrative problems had to be solved, especially those re- 
lating to the design and management of the sample. An appropriate 
model is described together with methods of sampling, estimating, and 
calculating sampling errors. Bias is reduced by using an employee in- 
struction booklet, anonymous reporting, and by classifying and coding 
activities. Cost of the plan is reduced by using a stratified cluster 
sample, a check-type data sheet, a simplified method of estimation, and 
by calling random times by telephone. 


PURPOSE 


HE purpose of this paper is to show how probability sampling can be ap- 

plied to work and nonwork activities in order to obtain data for cost analy- 
sis and cost control. Methods are described which show how estimates of cost 
can be made directly from the sample. This procedure goes beyond the usual 
practice followed in ratio delay studies or work sampling where estimates are 
limited to percentages. Techniques applicable to office or service operations are 
described in detail and illustrated by an actual study. Several unique factors 
not heretofore used in work sampling are described in detail; one of these is 
estimating directly from the sample various costs and the sampling errors ap- 
plicable to these costs. Methods of probability sampling and estimating are 
described which reduce the amount of work required, facilitate processing, and 
speed up the release of significant cost data to management. 


HISTORY 


The present technique of work sampling, also known as ratio delay study, 
seems to have had its origin in the early 1930’s when L. H. C. Tippett [7], the 
English statistician, applied it to machines and machine operators in a textile 
mill. Tippett used what he called “A Snap Reading Method” to obtain at 
random intervals observations about machine operators for the purpose of 
estimating the per cent of time machines were idle, the percentage composition 
of reasons for delay or down time, per cent of time workers spent on each of 
their duties and the percentage distribution of production so as to enable over- 
head cost to be distributed among several products. 

Tippett mentions four merits of work sampling: (1) it costs much less than 
the usual production study, (2) the method is more acceptable to workers and 





* This paper was prepared and the project described herein was carried out under the general supervision of 
E. J. Engquist, Jr., Director of the Statistics Division. 
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labor unions because no stop watches are used, as in time and motion study, 
(3) better data may be obtained because, while workers may change their ac- 
tivities when they discover a production study is being made, they cannot do 
this so easily when work sampling is used, and (4) when work sampling is used, 
major sources of variation, as well as random sampling error, can be isolated. 

Tippett made tests to determine whether or not the percentages derived by 
the sampling method were distributed as the binomial distribution. He found 
that this was true in some textile factory operations but warned that due to 
differences which might arise between time, machines, and operators the actual 
sampling error could exceed that given by the formula for the standard devia- 
tion of a proportion distributed as a binomial distribution where the proportion 
P is constant: c=+/PQ/n. He pointed out that relatively large samples were 
required—usually 500 or more. 

Some years later Morrow applied Tippett’s method to a number of industries 
and in general came to the same conclusions that Tippett did [5]. Morrow 
called this method “ratio delay study.” He pointed out that the method can be 
used to estimate percentage of time consumed in delay, percentage of time spent 
on various machine and nonmachine operations, and percentage of time ma- 
chines were working and not working. He pointed out also that the observa- 
tions should be distributed over all hours of the day and week, that the method 
provides a way of observing and evaluating operations of the department as a 
whole, and that the obse:vations may be interrupted at any time without 
affecting the study. 

Waddell, editor of Factory Management and Maintenance, introduced the 
term “work sampling” in order to emphasize the fact that Tippett’s technique 
need not be limited to the study of factors of delay but could be applied to the 
study of all work activities [3]. 

Brisley described how the method of random work sampling can be illus- 
trated and validated by drawing random time samples from a moving picture 
of a person at work. The estimated percentages of time derived from these 
samples were compared with true percentages calculated from the film [3]. 
He applied the method of work sampling to nurses, professional engineers, and 
executives for the purpose of determining among other things how much time 
was spent on regular duties and how much on duties someone else should per- 
form. 

In 1952 Abruzzi reviewed the problem of delay factors and described how the 
quality control chart can be used in connection with Tippett’s method in order 
to establish whether or not the pattern of delay is stable over time[1]. He re- 
ported that at one plant there were no significant differences in the delay per- 
centages within days and between days where the period covered was 7 days. 
This finding is similar to that reported by Tippett relative to delay percentages 
found in certain cotton mill operations in England. 

Barnes [2] compiled a wide variety of papers illustrating the applications of 
work sampling to the problems of industry. He described how the technique 
can be used not only to estimate relative frequency of idle status, sources of 
delay, and components of work operations but also to obtain data which can 
form the basis for wage incentives, work measurement, and performance rating. 
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A review of the foregoing papers and other papers in this field reveals a 
number of important points. In many instances detailed information is lacking 
as to how randomness is obtained in sampling machines and operators. No 
distinction is drawn between an individual sampling unit and a group or cluster 
sampling unit, with the result that inappropriate formulas are sometimes used 
to estimate sampling errors and sample sizes. Information is lacking relative to 
the most efficient sample design to use in any given situation. There is little or 
no information relative to stability of percentages over long periods of time, 
such as several months. This latter aspect is related to the problem of how best 
to distribute the sample over time. For example, in one case about 2300 observa- 
tions were taken in 3 days, and 5000 observations in 5 days. If the variation be- 
tween weeks is large compared with the variation within days, then the selec- 
tion of large samples each day for a few days is inefficient because it not only 
eliminates week-to-week variation but it may give an inadequate measure of 
variation between days—types of variation which should appear in the esti- 
mates. It may be better to take fewer samples per day and include more days 
and weeks. 

This review also shows that random work sampling may be preferred to 
other methods of cost analysis such as production studies, ticket systems, and 
test runs where each is used for the same purpose. In the production study 
continuous daily records are kept for each worker and machine showing each 
activity and the time spent on it. This record keeping is time consuming, costly, 
may be done carelessly, and may be biased if limited to short lengths of time. 
Furthermore, as Tippett pointed out, workers may deliberately change their 
work habits while such a study is in progress thereby introducing bias into the 
data. Morrow [5] reported that work sampling required only about one third 
of the time required to obtain similar data by the usual production studies. 
Barnes [2] cited a study where the time required for a work sample was less 
than 5 per cent of that required by the usual time study, yet the data were as 
good. 

Most of the applications referred to above were made in connection with 
factory operations, and for the purpose of estimating percentage of work time 
lost due to major sources of delay or trouble. That the technique can be applied 
to other problems and to nonfactory work, is illustrated by the example 
described below. 


THE ORGANIZATION AND THE PROBLEM 


Random work sampling is being applied to the work of the Statistics Di- 
vision in the Internal Revenue Service for the purpose of cost analysis and con- 
trol. This Division consists of about 350 employees divided into two main 
branches—one devoted to planning and the other to operations. There are 
about 50 employees in planning and administration; the remainder are in 
operations. The Division is divided into a total of 24 different sections and 
offices. 

The Division is a service agency providing professional service relative to the 
planning and approval of projects of a statistical nature, as well as providing 
large scale tabulation and computation services. This means that a wide variety 
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of jobs is required including those in the clerical, professional, and administra- 
tive fields. There are messengers, mail clerks, coding clerks, key punch opera- 
tors, tabulation machine operators, typists, statistical clerks, and secretaries. 
There are electronic machine programmers, analytical statisticians, mathe- 
matical statisticians, and economists. There are supervisors, chiefs of sections 
and branches, and administrative and executive officers. 

It was planned to collect several kinds of data: 


(1) time and cost associated with major functions, such as program and 
analysis; 

(2) time and cost associated with major programs, such as management in- 
formation reports; 

(3) time and cost associated with special services which are requested from 
outside the Division; and 

(4) time spent by a section, an office, and the Division, on nonwork activi- 
ties (a) due to production factors such as no work assigned, or (b) due to 
other factors such as leave, coffee breaks, and personal needs. 


These data would make it possible for management to do the following: 


(1) plan a sound and realistic budget for the entire Division; 

(2) determine month-to-month fluctuations in costs; 

(3) determine the additional resources that would be required by an expan- 
sion of function or program; 

(4) determine the resources that would be released by restricting a function 
or program; and 

(5) isolate weak and wasteful spots in the organization. 


With these data available management would be in a better position to plan, 
to appraise, and to take remedial action. It was not intended that the data 
obtained would be used to take action relative to individuals since the data 
records were to be anonymous, but instead the data obtained for a section or 
office would be used by the supervisor only on a group basis. 


THE SAMPLE DESIGN 


The test run. Once the use of work sampling in principle was approved by the 
director, it was necessary to develop a sample design and a method of collection 
which would be feasible under the various limiting conditions which had to be 
met. 

In view of the almost complete absence of knowledge relative to the charac- 
teristics which were to be estimated, and the kind of data collecting system 
which would work, a pilot or test run was made during one week of five con- 
secutive days in June 1955. 

During that week eight random time samples were taken daily of the work 
of every person at work in the Division. Random times were called by units or 
offices, so that the unit of sampling was a cluster of persons [4]. The time unit 
randomly selected was the minute, and a table of random numbers [6] was 
used to select the sample minutes from a total of 480 minutes for those units or 
offices where the lunch hour was excluded from the sample, and 510 minutes for 
those units or offices where the lunch hour was included in the sample. 








386 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 


The materials prepared and distributed for this test run included data 
record sheets, an instruction booklet, and a coded list of projects. Verbal in- 
structions were given to all employees prior to the start of the test run. 

Five higher grade clerks were used part time in order to inform units or 
offices of the random times and to collect and review data records. This latter 
step was necessary to insure proper understanding of instructions and to ac- 
count for all employees. 

Results of the test run. The results from the test run were so important that 
they more than justified the extra work involved. The major findings, which 
were incorporated into the final plan, were as follows: 


1. 8 random times daily were unnecessary; no more than one per day was 
required. Table 386 illustrates the homogeneity of an activity since a chi- 
square test shows no difference between morning and afternoon. Hence 
in a case like this, it is unnecessary to sample both morning and afternoon, 

. Variability was much greater among personnel in the Program Branch 
than among those in the Operations Branch, so different rates of sampling 
were called for. 

. More detailed instructions to employees were cailed for. Consequently, 
an enlarged booklet of instructions was prepared including code numbers 
to be applied to activities not covered by the list of projects. In addition, 
employees were to be given the code number of every job to which they 
were assigned. 

. Every supervisor was asked to review the data sheets for his unit to in- 
sure that (1) one data sheet was obtained from every person for every 
time called, and (2) each sheet contained no gross errors such as wrong 
salary grade, wrong organizational unit, or wrong type of activity. 


TABLE 386 


FREQUENCY OF OCCURRENCE OF ACTIVITY A IN RANDOM TIME 
SAMPLES COVERING 5 SUCCESSIVE DAYS 








Organizational 
Unit 








Total 158 





Work time division 50% 50% 











Expected frequency 172 172 








(158 — 172)? + (186 — 172)? 
172 172 
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THE FINAL SAMPLE DESIGN 


As a result of the test run, the sample was re-designed so as to give manage- 
ment adequate information on a timely basis. This sample was designed to give 
monthly estimates of percentages and costs for broad types of activities and 
major projects, for the Division as a whole. It was not designed to give adequate 
data for specific projects nor for small administrative units on a monthly basis. 
However, over a period of several months adequate estimates could be ob- 
tained for major projects and activities for each of the 24 administrative units. 

The total number of clusters drawn at random weékly is 3X5 units+2 X19 
units, or 53. This means 212 clusters every 4 weeks or 2,756 clusters during a 
year. These sample sizes of 212 and 2,756 are sufficient in view of the stability 
of the characteristics being estimated and the sampling errors which can be 
tolerated for management purposes. 

Two strata were introduced: the Program Branch with about 50 employees 
and the Operations Branch with about 300 employees. Three random observa- 
tions are taken every five days for the first group, and two random observations 
are taken every five days for the second group. This plan gives about 3,000 
observations every four weeks for the entire Division. (3X450+2X4X300 
=3,000.) 

Stratification is used because of the greater variability of the activities in the 
Program Branch. In this latter group which includes the higher level profes- 
sional and managerial personnel, the projects and activities engaged in vary 
greatly. On the other hand in the Operating Branch large numbers of employ- 
ees may be engaged on one project for weeks at a time. 

The random times are assigned at random to the days within each five day 
period, and only one random time is selected per day for any administrative 
unit. In other words the two or three random times cannot occur during one 
day—only one random time is assigned to any one day. 

The rate of sampling for the first group is three divided by the total number 
of minutes in 5 days, or 


3 1 


510X5 850 


The rate of sampling for the second group is two divided by the total number 
of minutes in 5 days or 





2 1 
2550 1275 


The total number of minutes in a day is 480+30 or 510, since the lunch hour is 
included in the sample. Since the sample unit is a minute, we can think of the 
population as finite and express the sampling rates as given above. 

As a result of the test run, the lunch hour of every employee is included in 
the time subject to random sampling. It is easier to do this since not all em- 
ployees have fixed lunch hours while all of those who do have fixed lunch hours 
do not have them at the same time. Observations which fall in the lunch hour 


4 
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can be excluded from the tabulations, or if desired, they can be used to estimate 
whether employees in the Division as a whole are taking more than the allotted 
time. 

For each random time every person in the organizational unit fills out a data 
sheet. Hence the sampling unit subject to random selection is a cluster of persons, 
not an individual. This means that while the formula for the sampling error of a 
proportion derived from a binomial distribution ~/PQ/n is applicable to the 
repeated sampling of individual sampling units, where a person is engaged in 
activity A or not, it is not applicable to group or cluster sampling units. The 
appropriate expression for the sampling error for the latter type of sampling is 
described and illustrated below. 

Minutes rather than individuals are subject to random selection. All em- 
ployees are included in the sample and are selected in 24 groups or clusters 
corresponding to the administrative units or sections into which the division is 
divided. Random times are selected from a list of three-digit random numbers. 
Random numbers from 000 through 509 are included to correspond to a work 
day beginning at 8:15 a.m. and ending at 4:45 p.m., 510 minutes later. Random 
numbers from 000 through 509 are added to 815 to obtain random times. For 
example, a random number of 369 corresponds to 8:15 plus 6 hours and 9 
minutes or 2:24 p.m. These random times are assigned to the 24 units so that 
each of the latter is sampled at different times, although of course several of 
them may be sampled on the same day. 

Random times are selected by the Sampling and Estimating Methods Sec- 
tion and turned over to the administrative office of the Division, and are tele- 
phoned to the supervisor or secretary of each unit as they occur. This is done 
as soon after the random time occurs as it is feasible to do so—usually within a 
minute or two. This call is a signal for every person in that unit to fill out a data 
record showing among other things what project he or she was working on at 
the time called. The supervisor or secretary is charged with the responsibility 
of seeing that a data record is filled out for every employee in the un:t regardless 
of whether present or absent; and of reviewing data sheets to insure that they 
are free of gross errors. 


THE DATA COLLECTING PROCEDURE 


The data collecting procedure can best be explained in connection with a 
booklet giving instructions to employees, a list of projects by name and code 
number, and the data sheet, a copy of which is reproduced here. This coded 
list of projects and activities, and the data sheet which usually required only a 
few seconds to complete, combined with anonymous self reporting, constitute 
the major features of the data recording and collecting procedure. 

Seven items of information are required on each data sheet. The first is the 
date and the second is the random time telephoned to the unit by the ad- 
ministrative office. The third is the salary grade, which can be checked on a 
list. The fourth is the project number, which is obtained from either coded list 
of projects or from the instruction booklet for employees if the activity is not 
covered by the project list. The fifth item is to determine whether the em- 
ployee lacks a work assignment. The sixth item is to determine whether the 
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DISTRIBUTION OF COST SURVEY—DATA SHEET 


Instructions: To determine the costs of the various projects engaged in by the Statistics 
Division, you will be asked periodically to complete the check sheet below showing how 
your time is being spent. DO NOT SIGN the sheet, but be as accurate as possible. This is 
not a measurement of your individual performance, but rather an attempt to determine 
more accurately the costs of the services rendered by the division. 





5. CHECK (,/) YOUR SPECIFIC 
1. DATE: ORGANIZATION: 

(1) (J Office of Director 

(2) () Administrative Office 





TIME: 








. CHECK (,/) YOUR GRADE: i PROGRAM BRANCH 
GSs-1 0 GS-9 9 (3) (J Office of Chiei 
GS-2 1] 18-10 [1] (4) () SOI Section 
GS-3 [1 GS-11 [1 (5) [) Management Stat. Sec. 
GSs-4 (J GS-12 [1 (6) () Operation Reports Sec. 
GS-5 0 GS-13 1 (7) (SEM Section 
Gs-6 O GS-14 4 
GS-7 O GS-15 [J OPERATIONS BRANCH 
18-8 [1] (8) () Office of Chief 

(9) () Stat. Tech. Lab. Sec. 





. ENTER PROJECT NO. 
CHECK (,/) IF NO WORK 
ASSIGNED [7]. 


Presentations Section 
(10) (j Office of Chief 
(11) (J Graphies Unit 
(12) () Typing and Prf. Unit 





. CHECK (\/) APPROPRIATE DE- 
SCRIPTIONS OF WHAT YOU WERE | = (13) [5 Office of Chief 


JUST DOING: (14) [ Coding Unit 

(1) () Performing assigned subject (15) ( Stat. Auditing Unit 
matter duties (Typing, coding, (16) (1) SOI Comp. and An. Unit 
punching, supervising, writing, (17) (J Prod. Stat. Comp. and An. Unit 


editing, planning, etc.) (18) ( Returns and Ree. Files Unit 
(] Performing related duties (Re- 


ceiving instructions, laying out Mechanical Operations Section 
work, getting supplies, sharpening (19) [J Office of Chief 
pencils, etc.) (20) (J Project Planning Unit 
(3) (J Waiting (For work, instructions, (21) () Transcribing Unit 
etc.) (22) [) Compiling Unit 
(4) (j Other than (1), (2) or (3) (23) (0 Punch Card File Unit 


(5) [J On leave (24) [) Mach. Res. Rev. Unit 
(6) (J At lunch 


Manual Operations Section 








U. S. Treasury Department-Internal Revenue Service Form M2079 
(Rev. 6-20-55) 


employee is in work status or not; if in work status, whether working on 
assigned duties or related duties, or waiting for work; and if not in work status, 
whether on leave or at lunch. The seventh item identifies to which of the 24 
organizational units the employee is assigned. 

The data sheets are verified to detect any gross errors such as wrong salary 


grade, wrong section, or an insufficient number of data records for a given 
section. 
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Does anonymous self reporting result in accurate data? Or conversely, does 
this method lead to distortion of the true work situation? Our experience cover- 
ing a period of two years leads to the conclusion that if there is any falsification 
of the true work situation, it is so infrequent as to be negligible. The evidence 
for this statement is as follows: 


1. Data records are anonymous. 

2. No administrative use is made of individual data. 

3. The random work sample has become routine since it has been carried on 
now for two years (November 1957). 

4. Units which show certain high percentages, such as per cent time waiting 
for work, are those for which observation leads to the same conclusion. 


THE METHOD OF ESTIMATION 


The mechanies of deriving costs from the sample data required the develop- 
ment of a set of cost factors. These cost factors required the use of base pay 
before deductions for each grade level inasmuch as the budget with which the 
cost estimates were to be related was prepared on a similar basis. The assigning 
of time and cost values to each of the sample observations could then proceed 
on a straight line basis since the time period covered, the base salary paid during 
the period covered by each individual set of observations, and the number of 
observations made were known. By dividing the total time and cost by the 
number of sample observations a unit value of time and cost for each sample 
observation was obtained. This unit value is the same for every person with the 
same base salary and the same sampling rate. 

For example, if the period of time covered was 5 days and the salary cost for 
5 days was $100 and the number of sample observations during the 5 days was 
3, then the values te be assigned to each sample observation was 5/3 days for 
time and $100/3 for cost. These values were introduced into a punched card 
which was prepared for each observation. These values were gang punched into 
each card for each person with the same base pay and the same sampling rate. 
By simple summation, the cost in time or money could then be obtained for 
any of the attributes appearing in the card. 


ADVANTAGES OVER OTHER METHODS 


The foregoing application of work sampling techniques includes a number of 
procedures which are an advantage over other methods. 


1. The collection of salary data on the data record and the use of the method 
of estimation inherent in the sample design make it possible to estimate 
salary costs and their proper sampling errors. This procedure avoids the 
loss of information due to the use of a method of estimation divorced 
from the sample design. 

. The telephoning of random numbers, the use of anonymous self reportig, 
the use of an abbreviated data sheet, the use of codes for reporting activi- 
ties, and the use of a simplified method of estimation all reduce drastically 
the cost of implementing the work sampling method without sacrificing 
the quality of the data obtained. 
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3. By making every minute subject to random sampling, and by including 
every person, it is possible to obtain estimates of time and cost which 
apply to the entire Division, and which therefore are applicable to budget 
planning and other problems involving the entire Division. Sampling 
some of the employees all of the time, or all the employees some of the 
time, or some of the employees some of the time, would not provide the 
information required. 

. Arandom time is called a minute or so after it actually occurs so that the 
sampling has no effect on what the person was doing. Furthermore, with 
the telphoning of random times, no one gets any advance cues, as is likely 
to be the case when a person makes a tour of an area in order to observe 
and record what employees are doing. 

5. Continuous long range sampling means that variations between days 
within weeks, between weeks within months, and between months within 
a year, are given their proper weights in the estimates. This cannot be 
done where studies are limited to a few days, or even to a few weeks. 

j. Sampling error formulas appropriate for cluster sampling are employed. 
Not all work sampling models are binomial requiring the use of PQ/n to 
estimate sampling variance. 


EXAMPLES OF ESTIMATES AND THEIR SAMPLING ERRORS 


The following examples illustrate methods of estimating proportions and 
aggregates and their sampling errors, using formulas described in the technical 
appendix. These examples are highly simplified for presentation purposes; in 
practice estimates would not be made from such small samples. 

If the size of the random sample (number of clusters or number of individuals 
selected at random) is large enough, then we are warranted in assuming that 
the estimate (proportion or aggregate) is distributed approximately as the 
normal curve. Under these conditions we can calculate the confidence limits— 
estimate +ks—where k is the normal deviate corresponding to the confidence 
level (1.96 for 95 per cent level), and s is square root of the sampling variance 
of the estimate derived from the sample. 

1. Assume the following array of activity A where 1 means presence of A 
and 0 its absence: 











M (time instants) Sum 
(time 
instants) 














Sum Y; 1 
P; 1/4 2/4 
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Since M=5 and N =4 the population proportion, equation (2) appendix, is 
P = }(1/4 + 1/4 + 2/4 + 2/4) = 
Also 


Select two times at random, say the second and fourth time intervals, so that 
m=2. Then an estimate of P =.40, equation (3), appendix, is 


p = 4(1/4 + 2/4) = 375 


also since N =4 and m=2 


N m 


p= > dw 


a 4X2 


2. The sampling error of the foregoing estimate of 0.375 is calculated as fol- 
lows equation (5) appendix: where M =5, m=2, and p= or .375; 
(5 — 2) x“ (1/4 — 3/8)? + (2/4 — 3/8)? 

5 2(2 — 1) 


3? = 








s = 0.096. 


3. Estimate the proportion and its sampling error if time is sampled at ran- 
dom, and we center attention on a particular person. Assume M =20X480 and 
N =50. 

Let m=20 (once daily). Assume 16 data sheets show activity A. 

Then 


16 _ 0.80 
diate Tite 4 


and from equation 13 appendix: 
© (20 X 480 — 20) X .80 X .20 
20 x 480 x 19 





s = .09. 


4. Assume random sampling of M only where M=2000 and m=10. The 
factor (M—m)/M can be ignored. Assume that the 10 sample values of p; are: 


09 -25 
15 .08 
12 -16 
.20 18 
ll 12 
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The average value 


1.46 0.146 
p= 10 = VU. ° 


Then from equation 5 appendix the sampling error of this estimate is 


10 


> (p; — 0.146)? 
1 


10 x 9 


3? = 





s = 0.017. 


5. Assume the following array of unit dollar costs for activity A for 8 per- 
sons. Let M=10X480 and m=10. 








m random times (minutes) 











Total 2; 











The estimated total cost is, equation (16) appendix, 


xs: 10 X 480 
Total cost = — > 2; = — r xX .51 = $244.80. 
m 1 


The sampling error of this estimate, equation (19) appendix, where 


51 
z= — = .05l, 
10 


48002 10 
>> (a; — .051)?. 
1 


es = 


~~ 10x9 


The value of s is $34.64. 
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TECHNICAL APPENDIX 


The activity array. Consider an activity array population of MN cells con- 
structed from M time units and N persons, in which each cell is identified with 
some activity which may differ from unit to unit and from person to person. 





Time Units 








3 





Ai 
Ax 
Au 





Am Aw? Ams Ay; Ayu 





Let yy =1 if Ag is the activity under study, and zero otherwise. Then the 
proportion of time spent on a particular activity in the jth cluster (time period) 
is 


(1) 


The proportion of time in the population spent on a particular activity over the 
total period of time is 


(2) 

If a random sample of m clusters (time periods) is selected from a total of M 

clusters (time periods) and if all N are included, then an unbiased estimate of 
P the population proportion is 


(3) 


where p, is the proportion in the jth sampled cluster. 
The true variance, V(p), of the estimate p is given by the following expres- 
sion: 


M 
L (P; — P)? 


—-M7! Jai 





ee” ee 
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The unbiased estimate of V(p), is given by v(p) where 


al 2 
we en” 


M m(m — 1) 





»(p) = (5) 
The factor (4/—m)/M can be ignored if m is very small (less than 5%) com- 
pared with M. 

For stratified random sampling the proportion of time spent on a particular 
activity over the whole period of time for al! k strata in the population is as 
follows: : 


1 k 
Pu, = — > N3P; (6) 
N j=l 
where N; is the number of elements in each cluster in the jth stratum, 
N= >} Nj, and P,’ is the proportion of time spent on a particular activity 
over the total period of time in the jth stratum. 
An unbiased estimate of P,, is 
1 k 
a — ZL N 3p,’ 
Ny 
where p,;’ is an unbiased estimate of P;’, and is obtainable from (3). 
The true variance, 


k 
DL N?V(p;') 
r st j=l 
V (pat) ae (8) 
where V(p;’) is the true variance of the proportion in the jth stratum, and is 
given by (4). 
An unbiased estimate of V (p,,) is given by v(p,.:) where 


k 
Dd N;v(p;') 
OA gps) se Sse (9) 
and v(p;’) can be obtained from equation (5) above. 

If estimates of the proportion of time spent by an individual on a particular 
activity are also of interest, then the following formulas are appropriate. Note 
that we are still sampling clusters; however in this case since interest is cen- 
tered in an individual, each cluster will have but two values, 1 or 0, depending 
on whether individual 7, say, is working on a particular activity or not. 

The proportion of time individual 7 spends on a particular activity over the 
total period of time is 


1 


= M > Y 4. (10) 


P; 
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An unbiased estimate of P; is p; where 
l m 
p= — Do vis (11) 
™M jut 


and y;; is the sample value of Y,;. 
The true variance V (p,) of the estimate p, is 
M-—m P Q; 


V(ps) = os 


where Q; = 1 — P,. (12) 





An unbiased estimate of V(p,) is given by v(p;) where 
M-m pd 
M m-—1 
The unit cost array. A logical counterpart of the activity array is the unit 
cost array. In this array there are MN cells the same as in the activity array, 
but each cell entry is the salary cost (C;;) assignable to a particular person and 
a specified time unit. 





v(p;) = and qi = 1 — pi. (13) 


THE UNIT COST ARRAY 














Time Units 
Individual 
1 2 3 eee ff cae) 
1 Cu Ci Cis Cy Cim 
2 Cu Cx Cos C2; Com 
3 Ca Cx Cx C3; Csm 
u Cu Ciz Cis Cu Cim 
N Cm Cya Cys Cy; Cy 








Let C,; represent the cost per unit of time associated with individual 7 at 
time j. Let X;;=C,; if activity A. is under consideration, but equal to zero 
otherwise. 

Then 

N 
Xi; = Zz Xj (14) 
t=1 
gives the aggregate salary cost in the jth cluster (column) for a particular 
activity. Also 
M 
x=) X; (15) 
j=l 
gives the aggregate salary cost for the particular activity for the total period of 
time. 
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If a random sample of m clusters is selected from a total of M clusters and 
if all N are included, then 
M m 
x’ =— Di 2x; = Mi 
™M j=l (16) 
is an unbiased estimate of the population aggregate X, where z; is the sample 
value of X; and where 


ET 


Ri 
ll 





m 
The true variance, V(x’), of the estimated aggregate x’ is 
M(M —m) 
Vee) = ——— 2 G@;, — 2) (17) 
m(M — 1) y 
where X is 


Dd X; 


1 
M 
An unbiased estimate of V(x’) is given by v(x’) where 
M(M —™m) = 
——— ) (z; — ®)*. (18) 
m(m — 1) ja 


If the sample m is small prey eo with M then this expression becomes: 





v(z’) = 





v(x’) = _ eats (x; — #)*. (19) 


m(m Kp m(m — 1) j=l 


If stratified random sampling is used we apply equation (16) to each stratum 
and add the k values of x’ in order to obtain the estimated total aggregate. Simi- 
larly we apply equation (18) to every stratum and add the k values of v(x’) in 
order to obtain the total sampling variance of the estimated total aggregate. 
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THE TRENTILE DEVIATION METHOD OF 
WEATHER FORECAST EVALUATION* 


Morris JaAmMEs SLONIM 
Headquarters, United States Air Force 


This article outlines a method originated by the author during World 
War II for evaluating short range weather forecasts. An interesting 
feature is the use of “deviational penalty scores” to adjust for the rela- 
tive ease or difficulty of forecasting for varying weather situations. A 
number of other studies on weather forecast evaluation by other authors 
lead to different methods of analysis (see references at erd of article). 
The author has encountered situations in other fields where variations 
of the method proved useful. As an instance, some elements of this 
method were used in designing a scoring procedure for rating bombing 
proficiency of air crews during the Korean War. 


DEVELOPMENT OF THE SCORING METHOD 


Nn order to evaluate the accuracy of weather forecasts, it is necessary, first, to 

take account of the degree of difference between forecast and observed val- 
ues. Thus, generally speaking, a person forecasting atmospheric pressure within 
one millibar of the observed value should get a better score for that particular 
forecast than a person whose prediction was off by ten millibars. In addition to 
taking account of the degree of difference, however, it is essential that account 
be taken of the relative difficulty or ease of forecasting various values of a given 
meteorological element. For example, it is much easier to forecast correctly 
“no rain” for the six-hour period preceding 1230 Z-time! during April for Jack- 
sonville, Florida, than it would be to forecast correctly .01 inch of rain for the 
same place for the same time. The point is illustrated by the following tabula- 
tion which shows the actual distribution of precipitation in Jacksonville for the 
six-hour period 0630 Z to 1230 Z for April over a six-year period: 














TABLE 398 
Precipitation A ts 
ce aa Number of Observations 
No Rain 147 
Trace 15 
01 6 
.02-.15 6 
-16 or more 6 
Total 180 








It can be observed from the above sample distribution that a person fore- 
casting “no rain” has a chance of 147 in 180 of predicting correctly; whereas 
* The author is particularly indebted to Lewis T. Reinwald of the Corporation for Economic & Industrial 


Research for suggestions made for this article. 
1 Z-time stands for Greenwich Mean Time. 
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the chance that a prediction of .01 inch of rain will be correct is only 6 in 180. 
It would seem logical that two persons correctly forecasting these two precipi- 
tation values from different six-hourly synoptic maps should not be given the 
same score. 

In order to take into account the degree of difference between forecast and 
observed, and at the same time to take account of the relative difficulty or 
ease of forecasting any meteorological element in different parts of the range, 
the following approach was taken. 

Denote the observed values of a specified variable meteorological element 
(e.g., precipitation, temperature, atmospheric pressure) by X with probability 
density function f(X). Let Y be the predicted value corresponding to each ob- 
served X. If we divide the frequency distribution of X into m intervals so that 
f(X) for each interval is 1/m we have transformed our original distribution to a 
uniform distribution. In the simplest situation we will have different values of 
X in each of the m equal probability intervals. In such a situation, these inter- 
vals I,, I, Is, - - - , Im can be assigned the successive location values of 1, 2, 
3, +--+, m. Then, any given forecast (Y) as well as the actual observed value 
(X) can be assigned the corresponding location values mx and my and the 
score S(X, ¥) =|mx—my]. Each distribution was divided into 30 equal proba- 
bility intervals (or trentiles).2 The number 30 was selected rather arbitrarily, 
and represents an attempt to secure a spread in scores while keeping the num- 
ber of intervals small enough to be manageable for machine processing. It de- 
veloped also that 30 intervals was about the maximum for a legible forecast 
form (See Figure 398). 

This method of scoring automatically takes account of the degree of differ- 
ence between forecast and observed values and thereby assigns a perfect score 
(zero) to a “hit” and a score of 1, 2, - - - , m—1 to a “miss,” with the highest 
scores assigned to the widest “misses.” More particularly, the scoring method 
takes account of the relative ease or difficulty of predicting a specific value ac- 
curately by converting the climatological data for a specified element into 
intervals of equal probability.® 

Following is an example of a table of trentiles for a distribution of atmos- 
pheric pressure values. 

The individual forecaster’s score on his prediction of atmospheric pressure 
for 0030 Z time for the week of April 5—April 12 for Seattle would be obtained 
by converting the difference between forecast and observed values into tren- 
tiles in the table and getting their difference. Let us assume a forecaster 
predicted a pressure of 1011.8 millibars for 0030 Z for Seattle for April 7th. 
From the table it can be seen that if the observed value let us say, 
turned out to be 1017.6 millibars, the forecaster’s score for this portion of his 
forecast would be 10 (i.e., 15—5). 





? For the sake of euphony and brevity the author took the liberty of using the French rathor than the more 
conventional Latin prefix in forming the word “trentile.” 

8 Frequency distributions of the values of each of the five elements forecast were obtained from observations 
recorded for previous years for each station at each of the Z-times for the 30-day period centering about the week 
in question. The five elements selected for the verification system were atmospheric pressure, temperature. ceiling 
height, visibility, and precipitation. 
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TABLE 400 
Atmospheric Pressure Values 
Trentile Number (in Millibars) 

1 less than & including 1004.4 
2 1004.5 to 1007.8 
3 1007 .9 to 1009.5 
4 1009.6 to 1010.5 
5 1010.6 to 1012.2 
6 1012.3 to 1013.2 
7 1013.3 to 1013.5 
8 1013.6 to 1014.2 
9 1014.3 to 1014.9 
10 1015.0 to 1015.2 
11 1015.3 to 1015.6 
12 1015.7 to 1015.9 
13 1016.0 to 1016.9 
14 1017.0 to 1017.3 
15 1017.4 to 1018.0 
16 1018.1 to 1018.3 
17 1018.4 to 1018.6 
18 1018.7 to 1019.0 
19 1019.1 to 1019.7 
20 1019.8 to 1020.3 
21 1020.4 to 1021.0 
22 1021.1 to 1021.3 
23 1021.4 to 1022.4 
24 1022.5 te 1023.4 
25 1023.5 to 1024.0 
26 1024.1 to 1024.4 
27 1024.5 to 1025.4 
1 
1 


28 1025.5 to 1027. 
29 1027 .2 to 1029. 
| 1029.2 and over 





The above trentile distribution for pressure was a simple example, purposely 
selected to illustrate the general method employed. It will be noticed that no 
single pressure value falls in more than one of the trentiles. To illustrate the 
procedure that should be followed when a single value of an element, because 
of its frequency of occurrence, occupies more than one trentile, the following 
table of trentiles was selected. 

It is apparent that a method is needed that will yield different trentile devi- 
ations between predicted and observed for correct forecasts of values occupy- 
ing more than one trentile (such as “no rain” or “trace” in Table 401, above) 
than it will yield for correct forecasts of values that are included in only one 
trentile (such as .01 inch). Thus, for a correct forecast of .01 inch of rain (24th 
trentile), the deviational score is zero (24 minus 24). If, however, a person fore- 
casts “no rain” and “no rain” is observed, it would obviously be unfair to give 
him a perfect score of zero, inasmuch as the likelihood of making a correct ran- 
dom forecast of .01 inch was only 1 in 30, as against 20 in 30 in the case of pre- 
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TABLE 401 


401 








Trentile Number 


Amounts of Precipitation 
(in inches) 





CoOnNOarh WN 


No Rain 





Trace 





-O1 





.02 





.03—.04 





-05-.08 





.09-.13 





.14-.21 





30 





.22 and over 





dicting “no rain” correctly. The method used to differentiate between two such 
correct forecasts in assigning scores was termed the method of “deviational 
penalties” and is based upon the following reasoning: 

Assume, for purposes of illustration, that precipitation forecasts and observa- 
tions can be measured within microscopic limits so that the value “no rain” 
can be broken into as many minute class intervals as there are trentiles in the 
“no rain” category. If “no rain” occupies 2 trentiles the following possibilities 


exist: 
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Forecast Value May Observed Value May 


Fall in Trentile: Fall in Trentile: Trentile Deviation 








1 
2 
1 
2 








Total Deviation 





The average deviation is 3: i.e., 





Sum of Deviations 2 
A 
= 


No. of Deviations 


The formula for the general case, where the number of trentiles embracing a 
single value of the element is N can be derived in the following way: 


D= Deviational Penalty 
N=Number of trentiles occupied by a single value of meteorological ele- 
ment. 
n;= Predicted trentile 
n; = Observed trentile 
N*= Total number of possible different pairs of forecast-and-observed values. 


N N 


> > | mi = n; | 
D = — Po (1) 





p15 | =25 D5 m-n) 


i=l j=l 
since 

|IN-1, +|N-2| +---+]2-N| +---+]1-N| 4+--- 
mie = 1): tn — 2 ts + (- 2 — FB) + >: + + ($e — ) + --- 
= 2(N —1)+2(N —2)+--:- 


Moreover, 


N 


> + ta, — nj) = 2 S| ne - i) 


t=1 jel 2 


né ny 
Yn =n > 1 =n? 


jul j=l 
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and 
> aoa n(n; + 1) 
inl 2 
Simplifying, (3) becomes 
z ng N(N+1)(2N+1) 3N(N+1) N(N?—-1) 
=O held 7} # 6 Tee 





_ N(N?-1)_ Nt=1 
ae eta 


This value, (V?—1)/3N is the “deviational penalty” trentile score assigned 
to a forecaster who correctly predicts a value of an element that occupies more 
than one trentile. The penalty score is based on the reasoning that if forecasts 
and observations were made within extremely fine limits the forecaster’s aver- 
age score over all possible different random forecast-and-observed situations 
within a group of trentiles including a single value would be (N?—1)/3N. In 
the precipitation distribution of Table 401 above, for example, a person fore- 
casting “no rain” would, if “no rain” were observed, get a score of 6.6, since “no 
rain” occupies 20 trentiles (i.e. N =20). 

Another situation that occurs frequently and which requires separate scoring 
treatment is one in which the forecast value differs from the observed value 
and either the forecast or observed values, or both, occupy more than one tren- 
tile. Let us examine, for example, the following distribution of temperature: 


TABLE 403 


D (4) 











Trentile No. Temp. in °F 
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The formula for this type of scoring situation can be derived as follows: 
Let 

T =Trentile score 

N,= Number of trentiles occupies by the observed value 

n;= Number of ith observed trentile 

N:= Number of trentiles occupied by the forecast value 

n;=jth forecast trentile 

K =Number of trentiles between forecast and observed ranges 
N,N:2=Total number of possible different pairs of forecast-and-oberved 





OBSERVED FORECAST 
VALUE VALUE 






































N, Trentiles K Trentiles Nz Trentiles 


Ne Ni 


D> Dd (M+ K +n) - 2 


fol snl 





NiN2 


Ni Ni(Ni + 1 
> [ viv, +K+n,) - eee) / NN: 


jel 





(2) 


NN2(N. 1 NiNAN 1 
1 ot ge 1 wot | / iN 


~ Baer + K) + 


Ni+ Ne 
2 

(The forecast and observed trentile ranges may of course be contiguous in 
which case K equals zero). 

Referring to Table 403 above, if a temperature forecast is 49°F and the ob- 
served is 43°F, the trentile score is 9, since N,=5, N2=3, and K=5. 

The use of simple differences in trentile values when both forecast and ob- 
served fall into single trentiles; the use of (V?—1)/3N when a correct forecast 
value occupies more than one trentile; and the application of 


Nit N. 
egayte 


for incorrect forecasts where forecast or predicted values or both occupy more 
than one trentile, together permit the scoring of all possible combinations of 
forecasts and observations. 

The trentile deviation score was intended as a skill score and the scoring sys- 
tem was designed so that the lowest trentile deviation scores would reflect the 
highest levels of forecasting skill. The “perfect” score for a series of completely 
accurate forecasts would be zero if all observed values occupied single trentiles. 
Otherwise, the “perfect” score would be the sum of the deviational penalties. 
This would represent the minimum score and optimum performance for any 
forecast. 


+K 
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Forecasting skill as reflected by the normalized trentile deviation scores‘ 
may be gauged by comparing a forecaster’s average score for a number of fore- 
casts with the average score obtained by the use of methods requiring no spe- 
cialized training in meteorology. A forecaster could be said to possess some 
degree of skill if his average score were significantly higher (for the standard- 
ized trentile deviation scores) than the highest of the “unskilled” forecast aver- 
age scores. Such “unskilled” scores were actually calculated daily during most 
of the operation of the “Trentile Deviation” program. Scores, for example were 
computed for “chance” forecasts by selecting one of the 30 trentiles at random 
for each of the 25 forecast situations daily; for “climatological” forecasts, which 
were prepared by predicting the maximum frequency value for each of the five 
elements for each of the five cities each day; and for “persistence” forecasts by 
predicting that the weather element values would remain the same over the 
period of the forecast. 


THE SHORT RANGE WEATHER FORECAST VERIFICATION FORECAST FORM 


When the Short Range Verification Program was inaugurated, in April, 1943, 
the forecasters made 24-, 36-, and 48-hour forecasts for five selected stations 
(which were changed weekly), for atmospheric pressure, temperature, ceiling 
height, visibility, and precipitation, using blank forecast forms. 

After the program had operated for 20 weeks, a revised program was inau- 
gurated in October, 1943. A new short range forecast form, Figure 407, was 
devised. The new form supplied the forecaster with a certain amount of climato- 
logical background showing the climatological distribution of the elements to be 
forecast. This form also simplified the preparation and processing of the fore- 
casts, so that virtually the entire scoring procedure was now performed rapidly 
and economically by punch card machine methods. 

In addition, the requirements were changed (when the new program was 
initiated) to provide for the use of only one synoptic map daily instead of four 
and to provide for forecasts covering only one period weekly instead of three. 
The period of forecast was changed each week to include 24-, 48-, 12-, and 36- 
hour forecasts, in that order, in each four-week interval. The five stations were 
changed every four weeks. The same five meteorological elements were retained 
for the duration of the project. 


SCORING THE FORECASTS 


Each forecaster was requested to complete a minimum of three forecast 
forms each week. The forecast form (Figure 398) provides for forecasting five 
elements for each of five stations—a total of 25 separate predictions. Each of 
the 25 predictions was scored according to the “trentile deviation” system 
described above, and the forecaster’s total “trentile deviation” score for each 
form was obtained by summing these 25 individual forecasts. Since the scores 
for all elements and for all the stations were in terms of a uniform measure, the 
trentile scores for the separate stations and for the separate elements could 
properly be combined into a single total score. 





4 As noted in a later paragraph, the trentile deviation scores were subsequently converted to standard scores 
in order to equalize the effect of “easy” and “difficult” forecast days. 
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Since all forecasters in the continental United States forecast for the same 
five elements, for the same five stations, from the same synoptic maps (1230 Z), 
their scores in any given day were believed to be comparable. 

In the early weeks of the program each forecaster’s weekly (or “to date”) 
average score for a given length of forecast was obtained by adding his “trentile 
deviation” scores for the period in question and obtaining his average score per 
forecast form. Analysis of the daily distributions of “trentile deviation” scores 
indicated that the scoring system did not automatically adjust sufficiently for 
the variations in synoptic situations from day to day; so that some forecasters 
might be unduly rewarded while others might be overly penalized in the aver- 
age rankings as a result of the daily changes in the relative ease or difficulty of 
forecasting from the varying synoptic situations. (See reference [14] for a re- 
view of the results of the first six months of the verification program.) 

Accordingly, all “trentile deviation” scores for each day were converted to 
standard scores. In this way the scores for each day were made comparable 
with those of the other days in which forecasts of the same duration were made. 
Some 500,000 short range weather forecasts were evaluated from inauguration 
of the program until the end of World War II using the “trentile deviation” 
system. 
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WEIGHT-HEIGHT STANDARDS BASED ON WORLD 
WAR II EXPERIENCE 


BernarpD D. KarpPrinos 
Medical Statistics Division, Office of The Surgeon General, Army 


From data on height and weight (nude) of registrants for military 
service during World War II, relationships have been established be- 
tween weight and height. From these relationships, which may be con- 
sidered representative of the male population from 18 to 37 years of 
age, three weight-height tables have been prepared for White, Negro, 
and Total (White and Negro combined) men, for age groups, 18-19, 
20-24, 25-29, 30-34, and 35-57. Each table shows five expected-weight 
values (in pounds) for individuals of particular height (in inches) and 
age. The five values consist of a midvalue, i.e., the expected mean value, 
and two values on either side of the midvalue, comprising ranges of 
+1.2816 and +1.6449 standard errors of estimate. The weight-values 
are given for height-ranges from 59 through 78 inches, in terms of one- 
inch intervals. 


HE data utilized here are from a general study by the author on height 
er weight of registrants processed for military service in World War II [4]. 
The primary objective of this paper is to present weight-height standards 
based on that experience. The current widely-used weight-height standards for 
adults trace back essentially to the weight-height table of the Medico-Actuarial 


Mortality Investigation of 1909-1912, based on the height and weight of in- 
surees of more than four decades ago [1]—a fact often omitted in the various 
investigations. But—as is well recognized—weight and height change with 
changes in socio-economic conditions [2, 8]. 


NATURE OF DATA 


The data presented here relate to Selective Service registrants examined for 
military service during the thirteen-month period from January 1943 through 
January 1944. Somewhat over 5.5 million men, 18 through 37 years of age, were 
examined during this period by the Joint Army and Navy Induction Stations. 

Each registrant arriving at an induction station was ordinarily given a com- 
plete medical examination in order to determine his acceptability for military 
service. The results of the examination were recorded on a form (DSS Form 
221, “Report of Physical Examination and Induction”) which, in addition to 
the clinical evaluation of the registrant’s medical (physical and psychiatric) 
status, included related laboratory (x-ray findings, serologic tests for syphilis, 
etc.) and anthropometric data (weight, height). 

The induction stations were required to submit a copy of these physical 
examination forms to the office of The Surgeon General of the Army, for: (a) 
Each registrant who was disqualified for military service, and (6) each qualified 
registrant to be inducted into the Army. During this period about three-fourths 
of the qualified registrants called for induction were assigned to the Army and 
about one-fourth to the Navy. Copies of the physical examination forms for 
Navy inductees were not available to The Surgeon General’s Office. Adjust- 
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ments, explained later, were therefore made for this partial coverage of the 
inducted registrants in order to reflect the total examined registrant popula- 
tion. 

Each physical examination form carried a Selective Service number assigned 
to the registrant by his local board. From all the forms received during this 
period a ten-per cent random sample was drawn by choosing a single terminal 
digit of the Selective Service numbers and using all forms for which this was 
the terminal digit. About 465,000 medical examination forms were thus ob- 
tained, relating to individuals of known age and of known height and weight. 
A distribution of these forms by age, race, and military qualification (inducted 
or disqualified) of the individuals included in the analysis is shown in Table 
409. Age is given as of last birthday. The term “White” used in the racial 
classification refers to Non-Negro. 

The height and weight data for this study were taken from these sample 
forms. The examinees were measured without shoes and weighed without 
clothing. Height was recorded in inches and weight in pounds, as specified on 
the physical examination forms. 

With respect to measuring height, specific instructions were provided by the 
Army regulations on physical standards (Mobilization Regulations 1-9, 1942) 
which read as follows: 

“Use a board at least 2 inches wide by 80 inches long, placed vertically, and carefully 
graduated to } inch between 58 inches from the floor and the top end. Obtain the height 
by placing vertically, in firm contact with the top of the head, against the measuring 
rod an accurately squared board of about 6 by 6 by 2 inches, best permanently attached 
to graduated board by a long cord. The individual should stand erect with back to the 
graduated board, eyes straight to the front.” [11] 


Height was usually recorded in whole inches, though it was sometimes given to 
a half or quarter inch. Recorded fractions of an inch were dropped in coding the 
height data. Weight was presumably taken to the nearest pound. Fractions of 


TABLE 409 


NUMBER OF PHYSICAL EXAMINATION FORMS INCLUDED IN THE 
SAMPLE STUDY OF HEIGHT AND WEIGHT, BY AGE, RACE, AND 
MILITARY QUALIFICATION OF THE EXAMINEES 


(January 1943 through January 1944)* 








White Negro 





Age 
Inducted Disqualified Inducted Disqualified 





18-19 77,744 32,507 + 9,171 6,968 
20-24 67 ,824 29 ,971 FE 11,559 9,572 
25-29 40 ,934 28,971 6,661 9,446 
30-34 34,567 34,225 5,407 10,897 


35-37 16,303 22,891 2,535 6,422 











Total (18-37) 237 ,372 148 , 565 35 , 333 43 ,305 











* Source: Sample tabulations of DSS Form 221 (“Report of Physical Examination and Induction”), 
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pounds, when recorded on the forms were disregarded, and weight was coded 
in whole numbers as in the case of height. 


WEIGHT-HEIGHT RELATIONSHIP 


In order to establish the relationship between weight and height, cross tabu- 
lations of weight by height were made for each subgroup, as shown in Table 
409. Weight was expressed in the tabulations in 5-pound intervals, against 
height in one-inch intervals. 

Based on these bivariate distributions, the means, standard deviations, and 
covariances, required for correlation analysis, were computed for each sub- 
group. The logarithms of weight, rather than the actual weights, and the actual 
heights were used in these calculations. The choice of this procedure was 
prompted by the following considerations. First, the F test for linearity of re- 
gression (Kendall [5], p. 234) applied to some of the subgroups, by linearly 
correlating actual weights and actual heights, revealed significant departures 
from linearity. (A similar phenomenon was found by Rosenbaum [10] in his 
weight-height investigation of English recruits.) Second, the standard devia- 
tions of weight for each inch of height tended to increase with height for each 
subgroup, while the corresponding coefficients of variation proved to be fairly 
constant, thus suggesting semilogarithmic regressions. (See Kenney and Keep- 
ing [7], p. 210; O’Brien, Girshick, and Hunt [9], p. 70, state that an analysis 
of their data indicated a semilogarithmic relationship of weight to height.) 

Differences were found in the height and weight values of the various sub- 


groups, whether differentiated by age, or race, or military qualification (in- 
ducted and disqualified). 


ADJUSTMENTS FOR DISPROPORTIONALITY 


As previously indicated, copies of the physical examination forms of the 
Navy inductees were not available to The Surgeon General’s Office. Since dif- 
ferences were found between the weight-height values of qualified registrants 
(inductees) and those of disqualified registrants, direct additions of the bi- 
variate distributions of these subgroups—for the purpose of obtaining cor- 
responding distributions for the examined (inducted plus disqualified) groups— 
would have given undue weight to the disqualified subgroups and would have 
introduced thus a certain bias into the calculations. (For instance, for the 
(18-19) white age-group, the ratio of inducted to disqualified, as tabulated, was 
71:29, versus an “expected” ratio of 76:24 for qualified versus disqualified ; See 
Appendix, Table 417.) 

To correct for these disproportionalities, the various values:means, stand- 
ard deviations, and covariances derived from the bivariate distributions of the 
inducted and disqualified subgroups were weighted by their respective “ex- 
pected” ratios (Table 417). (See “Appendix: Technical Note” for the rationale 
and method of computation.) The assumption was made in this case that all 
qualified registrants were distributed by weight and height as the Army in- 
ductees, since neither height nor weight nor other related factors played a 
decisive part in the assignment of qualified registrants to the Army or the 
Navy. 
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TABLE 411 


COEFFICIENTS OF CORRELATION, REGRESSION EQUATIONS AND 
STANDARD ERRORS OF ESTIMATES OF WEIGHT (W) ON HEIGHT 
(H) OF REGISTRANTS PROCESSED FOR MILITARY SERVICE 
DURING WORLD WAR II, BY AGE AND RACE 


(January 1943 through January 1944)* 








; Regression Equations: Standard Error of 
Age and Race Correlations Loge cnthht Estimate (Log.) 





White 
18-19 ‘ .38468 +0 .01129H 0.05330 
20-24 ‘ .45443 +0.01048H 0.05579 
25-29 ‘ -48355 +0 .01029H 0.06126 
30-34 ‘ .50883 +0 .01003H 0.06363 
35-37 : .49814+0.01025H 0.06434 


Negro 
18-19 ‘ .39879 +0.01108H 0.04387 
20-24 ‘ .43190 +0.01082H 0.04455 
25-29 ‘ .42707 +0.01105H 0.04968 
30-34 t .43843 +0.01098H 0.05448 
35-37 , .45003 +0 .01088H 0.05774 





Total 
18-19 .4826 .38623 +0.01126H 0.05239 
20-24 .4419 .45210+0.01052H 0.05477 
25-29 .4039 .47730 +0 .01038H 0.06027 
30-34 .8855 .50139+0.01013H 0.06284 
35-37 .3860 .49341 +0.01031H 0.06379 











* Coefficients of correlation and regression equations were derived by correlating the logarithms of weight 
(Logi W—in pounds) with height (H—in inches). 


The correlation coefficients, regression equations, and standard errors of 
estimate, given in Table 411 by age and race (white, Negro), were derived 
from these adjusted values. 

With these adjustments, the derived weight and height values, by age and 
race (Table 411), may be regarded as representative not only of the examined 
registrants, but of all registrants within the age range, though they exclude 
registrants disqualified by the local boards prior to the induction processing, 
as well as deferred and exempt registrants. During this period the local boards 
played a minor role in the screening of registrants for military service, so that 
the exclusion of registrants disqualified by the local board could have not 
produced an appreciable distorting effect on the weight-height data. As to the 
deferred and exempt registrants, there is no reason to believe that their dis- 
tributions by weight and height differed materially from that of the other 
registrants. There are indications that such differences would be primarily a 
function of age. But these differences have been essentially eliminated by the 
fact that the weight-height values were derived by age. 

In combining the values by age, irrespective of race, a further adjustment 
was made, necessitated by the fact that the racial composition of the examined 





412 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 


registrants was different from that of all registrants or of the Census popula- 
tion. The additional adjustment was made by weighting the adjusted age-race 
values of the examined registrants by the Census ratios (Table 418). These 
adjusted values are given in Table 411, marked “Total.” 

The weight-height values presented here may therefore be considered as 
generally relating to men within the (18-37) age-range. 

It will be noted that the correlation coefficients of weight on height (‘Table 
411) tend to decrease with age for both white and Negro men, within the 
given (18-37) age-range. In other words, weight becomes patently less a func- 
tion of height as age increases. The correlation coefficients are higher for Ne- 
groes than for whites, for all age groups within the (18-37) age-range. 


WEIGHT-HEIGHT TABLES 


Three weight-height tables (Tables 412, 414, and 416) were derived from 
Table 411 for men within the (18-37) age-range, separately for White, Negro, 
and Total (White and Negro combined). The weight-values are for each inch 
of height, within the height-range of 59 through 78 inches, by the following 
age-groups: 18-19; 20-24; 25-29; 30-34; and 35-37. 

Five weight values (in pounds) are given in each table for each inch of height, 
by age, consisting of a midvalue (italicized) and two values on either side of 
the midvalue. The midvalue represents the expected average weight in pounds 
for individuals of the particular height within the specified age group. The two 
values closest to the mean (+1.2816 standard errors of estimate) indicate a 
range of weight that is expected to include 80 per cent of the individuals of the 


TABLE 412 
WEIGHT FOR EACH INCH OF "EIGHT: WHITE MEN, AGES 18-37* 








Weight (Pounds) Weight (Pounds) 
Height According to Age Height According to Age 
(Inches) (Inches) 








18-19|20-24/25-29130-24/35-37 


20-24'25-29|30-34135-37 








92} 96} 98] 99] 99 122 | 124 | 125 | 126 
96 | 100 | 103 | 104 | 105 128 | 130 | 132 | 133 
112 | 118 | 123 | 126 | 127 ¢ 151 | 156 | 159 | 160 
132 | 139 | 148 | 152 | 153 177 | 187 | 192 | 194 
138 | 146 | 155 | 160 | 162 186 | 197 | 202 | 205 
94 | 98} 100 125 | 127 129 
99 | 103 | 105 , 131 | 133 | 1: 136 
115 | 121 | 126 : 154 | 160 3 | 164 
135 | 143 | 151 182 | 192 5 | 199 
150 191 | 202 210 








97 | 101 128 | 130 132 
105 134 | 137 : 139 
124 158 | 164 168 
146 186 | 196 203 
153 195 215 
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TABLE 4i12—(continued) 





Weight (Pounds) Weight (Pounds) 
Height According to Age Height According to Age 
(Inches) er ————| (Inches) 
18-19|20-24|25-29|30-34/35-37 20-24|25-29)30-34): 
we 

99 | 103 | 105 | 106 | 107 131 | 133 | 134 
104 | 108 | 110 | 112; 113 137 | 140 | 141 
122 | 127 | 1382 | 135 | 136 162 | 168 | 170 
142 | 150 | 159 | 163 | 164 191 | 201 | 205 | 208 
149 | 157 | 167 | 172 | 174 200 | 211 | 217 | 220 














102 | 105 | 107 | 109 | 109 134 | 136 | 137 | 138 
107 | 110 | 113 | 115 | 115 141 | 143 | 144 | 146 
125 | 130 | 136 | 138 | 139 166 | 172 | 174 | 176 
146 | 154 | 162 | 167 | 168 195 | 206 | 210 | 213 
153 | 161 | 171 | 176 | 178 205 | 217 | 222 | 225 





105 | 108 | 110 | 111 | 112 138 | 139 | 140 | 142 
109 | 113 | 116 | 117 | 118 144 | 147 | 148 | 149 
126 | 1383 | 139 | 142 | 143 166 | 170 | 176 | 178 | 181 
150 | 157 | 166 | 171 | 172 194 | 200 | 211 | 215 | 218 
157 | 165 | 175 | 180 | 182 203 | 210 | 222 | 227 | 230 
107 | 111 | 113 | 114] 114 139 | 141 | 143 145 
112 | 116 | 119 | 120 121 146 | 148 | 150 153 
131 | 137 | 142 | 145 | 14 17 174 | 180 | f 185 
154 | 161 | 170 | 175 | 177 199 | 205 | 216 | 2: 224 
161 | 169 | 179 | 184 | 186 209 | 215 | 227 | 232 | 236 














110 | 113 | 115 | 117 | 117 143 144 | 146 148 
115 | 119 | 121 | 123 | 124 149 | 151 | 154 157 
135 | 140 | 145 48 | 150 175 | 178 | 184 | 187 | 189 
158 | 165 | 174 | 179 | 181 205 | 210 | 221 | 225 | 229 
165 | 173 | 183 | 189 | 191 214 | 220 | 233 | 238 | 242 








| 113 | 116 | 118 | 119 147 | 148 | 150 | 150 | 152 

118 | 122 | 214} 126 153 | 155 | 158 | 158 | 160 
| 188 | 148 | 149 | 152 180 | 1838 | 189 | 191 | 194 
162 | 169 | 178 | 183 210 | 215 | 226 | 231 | 234 
169 | 177 | 188 | 193 220 | 226 | 238 | 243 | 247 





| 116 | 119 | 121 | 122 151 | 151 | 153 | 154 | 156 

121 | 125 | 127 | 129 157 | 159 | 161 | 162 | 164 
142 | 147 | 153 | 156 184 | 187 | 193 | 195 | 199 
166 | 173 | 183 | 187 216 | 220 | 232 | 236 | 240 
174 | 182 | 192 | 198 | 200 225 | 231 | 244 | 249 | 253 






































* The weight values were computed from the respective regression equations of the logarithms of weight 
(logie W—in pounds) on height (H—in inches), by age. The italicized values represent the expected mean values 
of weight for the particular height. The values given above and below the mean values represent ¥1.2816 and 
¥1.6449 standard error of estimate, covering within this range 80 and 90 per cent of the population around the 
mean, respectively. The first range is expected thus to eliminate 10 per cent and the other 5 per cent of the popula- 
tion of the particular height, on either side of the given range. The basis for these values are height and weight data 
on white registrants processed for military service during World War II (January 1943 through January 1944). 
The examined registrants were measured without their shoes and were weighed with their clothing removed. 
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TABLE 414 
WEIGHT FOR EACH INCH OF HEIGHT: NEGRO MEN, AGES 18-37* 








Weight (Pounds) Weight (Pounds) 
Height According to Age Height According to Age 
(Inches) (Inches) : 

| 








18~19]/20-24/25-29/30-34 18-19 20-24 25-29)30-34 35-37 


96 | 99; 99 123 | 127 | 128 | 128 





99 | 103 | 104 128 32) kk 134 | 134 
118 | 120 j J 157 | 159 


128 | 134 | 139 ‘ j ‘ 185 | 188 
193 | 198 








98 31 | 132 | 131 | 131 

137 | 137 
116 i 59 | 161 | 163 
132 : ; 189 | 193 
198 | 203 





134 
141 
166 
194 

203 





138 

144 
169 
199 
208 








141 
148 
174 
204 
214 











145 
152 
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TABLE 414—(continued) 





Weight (Pounds) Weight (Pounds) 
Height According to Age Height According to Age 
(Inches) (Inches) l 

18—19)20—-24 ssiiuat wale. a0-20)25-20)08-08 ‘ 














117 | 121 | 122 | 122 156 | 157 | 156 
122 | 126 | 127 | 127 161 | 164 | 164 
138 | 144 | 147 | 149 184 | 190 | 192 
158 | 164 | 170 | 175 203 | 210 | 220 | 226 
163 | 170 | 178 | 184 218 | 229 | 236 








120 | 124 | 125 | 125 159 | 161 | 160 
125 | 129 | 130 | 130 j 166 | 168 | 168 
142 | 147 | 161 | 163 6 189 | 196 | 197 
162 | 168 | 175 | 180 215 | 225 | 232 
168 | 174 | 182 223 | 235 | 242 












































* See Footnote to Table 412. 


particular height. In other words, 40 per cent of these individuals are expected 
to fall within this weight-range below the mean weight, and 40 per cent above 
the mean weight, with 10 per cent falling outside of either of these ranges. The 
other two weight values, farther removed from mean value (F 1.6449 standard 
errors of estimate), indicate an expected coverage of 90 per cent of the particu- 
lar height-group. (See footnote to Table 412.) 

These are, of course, arbitrary ranges of weight values not an attempt to 
define “normal weight,” “underweight,” or “overweight.” In this respect, these 
ranges of values resemble those in the Medico-Actuarial Mortality investiga- 
tion of 1909-1912. The latter ranges are values arbitrarily chosen as 20 per cent 
under or over the observed average weight [1], although it has become com- 
monplace to consider the values within these ranges as “normal,” and those 
outside these ranges as “underweight” (more than 20 per cent under the 
average weight) and “overweight” (more than 20 per cent over the average 
weight). 

It should be kept in mind that the weight and height measurements pre- 
sented here were made in the nude, while the applicants for insurance involved 
in the 1909-1912 study were measured in their shoes and weighed in their 
“ordinary indoor clothes, with coats sometimes removed” [1]. 

In this connection the foliowing facts might be mentioned: 

(a) The average height of the examined registrants of World War II was 
68.0 inches and their average weight was 150.5 pounds. 

(b) The average height of the inductees of World War II was 68.1 inches 
and their average weight was 150.5 pounds. 

(c) The inductees of World War II were on the average 0.6 inches taller 
and 8.9 pounds heavier than those of World War I [3, 4]. 


APPENDIX: TECHNICAL NOTE 


As stated in the text, adjustments for disproportionality were made in de- 
riving the coefficients of correlation for the examined groups (inducted plus 
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TABLE 4i6 
WEIGHT FOR EACH INCH OF HEIGHT, BY AGE: MEN, AGES 18-37* 








Weight (Pounds) Weight (Pounds) 
Height According to Age Height According to Age 
(Inches) (Inches) 
vedi wallets woven: 30-34|35-37 20—24/25-29/30-34 











92 | 96 98 | 99 | 9S 122 | 124 | 125 
96 | 101 | 103 | 104 128 | 131 | 132 
| 112 | 118 | 123 | 126 5 151 | 158 | 159 
131 | 139 | 147 | 151 | 15: 177 | 187 | 191 
137 | | 155 | 159 | 16 185 | 196 | 201 
95 9 | 100 | 101 125 | 127 | 128 
99 | 103 | 105 | 107 | 1: 131 | 134 | 135 
115 | 121 129 154 | 160 | 162 
135 2} 150 | 155 181 | 191 | 195 
141 | 158 | 163 5 190 | 201 | 206 
97 | 101 | 103 | 104 129 | 130 | 131 
101 | 106 | 108 | 109 135 | 137 | 138 
| 118 | 24 | 129 132 | 13: 158 | 164 | 166 
138 | 146 | 154 | 158 | 16 186 | 196 | 200 
144 | 153 | 162 | 167 ¢ 195 | 206 | 211 


| 100 | 103 | 105 |. 106 132 | 134 | 134 
| 104 108 | 111 | 112 ‘ 138 | 140 | 141 
62 Was. | 127 | 182 | 186 , 162 | 168 | 170 




















142 | 150 | 158 | 162 | 16 190 | 200 | 205 
148 | 157 | 166 | 171 199 | 211 








102 | 106 | 108 | 109 135 | 137 
| 107 | 111 | 113 | 115 | : 141 | 144 
| 125 | 130 | 135 | 138 | 166 | 172 
| 146 | 153 | 162 | 166 195 | 205 
| 152 | 160 | 170 | 175 204 | 216 





| 105 | 109 | 110 | 111 140 
| 110 | 114 | 116 | 117 | 147 
64 | 128 | 133 | 139 | 141 176 
| 149 | 157 | 166 | 170 | 172 ¢ 210 
| 156 | 164 | 174 | 179 | 9 | 221 


BE Seat Seen Boos Sig 





| 108 | 111 | 113 | 114 143 
113 | 116 | 119 | 120 151 
65 | 131 | 137 | 142 | 146 | 180 
| 153 | 161 | 170 | 174 215 
| 160 | 168 178 | 183 226 


ae oe si 


| 


111 | 114 
115 | 119 
66 | 136 | 140 





147 
155 
185 


164 | 172 232 























| 
| 
| 
oft « 
157 | 165 | 221 
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TABLE 416—(continued) 





Weight (Pounds) Weight (Pounds) 
Height According to Age Height According to Age 
(Inches) yl (Inches) . | 
9|/20-24| 25—29/30-34| 35-37 18-19|20-24125-29|30-34/35-87 











119 | 119 : 147 151 | 152 
125 | 126 27 154 159 | 161 
149 | 151 ‘ 179 191 | 194 


118 
178 | 182 209 d| 226 | 230 | 234 


138 
| 161 
| 169 





— 
SIO. Ne 


NOt Ns 


187 | 192 ¢ 219 243 | 247 








154 | 156 
164 
198 


121 | 122 151 

128 | 129 158 
152 | 155 184 
182 | 187 ¢ 215 
192 | 197 224 


Noo 





oO amme 
“It. bo bo 
23 


























ae 
| ta 
| 168 
| 17 


| 
| 
| 





* See footnote to Table 412. 


disqualified registrants), by age and race, and by age alone (irrespective of 
race). These adjustments were necessitated by the following facts. First, no 
copies of the physical examination forms were received for Navy inductees. 
Consequently, direct additions of the bivariate distributions of the inductees 
by height and weight to the bivariate distributions of disqualified registrants 
would have given undue weight to the disqualified subgroups and would thus 
have introduced a bias in the calculations, since differences were found between 
the weight-height values of the subgroups. Second, the racial composition of 
the examined registrants was different from that of all registrants or the Census 
population for that period. Since there were differences in the height and weight 


TABLE 417 


RATIOS OF INDUCTED TO DISQUALIFIED IN SAMPLE VERSUS 
EXPECTED RATIOS OF QUALIFIED TO DISQUALIFIED 








White Negro 





Sample’ Expected? Sample! Expected? 








Dis- Quali- Dis- 


In- Dis- Quali- | Dis- In- 
mean fied qualified 


ducted | qualified) fied | qualidied ducted | 











20-24 .6935 .3065 .7491 | .2509 .5470 
25-29 .5856 .4144 .6508 | .3492 .4135 
30-34 -5025 .4975 .5712 | .4288 .3316 


.4530 .5721 .4279 
.5865 .4385 .5615 
.6684 .3546 .6454 
.7170 .3042 .6958 





35-37 .4160 .5840 .4844 -5156 . 2830; 











18-19 . 7052 . 2948 .7593 | . 2407 .5683 | .4317 .5931 .4069 
| 














1 Based on Table 409. 

? Derived from reports (“Monthly Report of Examination of Joint Army and Navy Inductees”) submitted 
monthly during World War II to the Surgeon General's Cffice by each induction station. These reports showed, 
by race, the number of registrants examined, number disqualified (by disqualifying cause), and the number of 
inductees by type of qualification and department of assignment: Army or Navy. 
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values by race, additional adjustments were required for this disproportionality 
in order to obtain values as close as possible representative of the total popula- 
tion (white plus Negro). 

The adjustments for disproportionality were computed in the following man- 
ner: 

(a) Examined group (inducted plus disqualified), by age and race. The data 
were adjusted by weighting the computed separate values (means, standard 
deviations, and covariances) of these subgroups by the “expected” ratios as 
given in Table 417, by age and race. 

(b) Examined group (white plus Negro), by age. The data were adjusted by 
further weighting the adjusted values derived in (2) by the white to Negro 
ratios given in Table 418, by age. 

The computations of the weighted values (means, standard deviations, and 
covariances) were as follows: 

Let X; and Y; represent the respective height and weight means of a sub- 
group, and X and Y the mean values for the corresponding total group. Obvi- 
ously, 


X¥=>owX;i (G=1,---,k) (1) 


where w;, is the proportion of the subgroup within its total group ( }-w;=1.0). 
For the (20-24) white age group, for instance, the w,’s are .7491 and .2509 
(Table 417), when combining the respective inducted and disqualified sub- 
groups. For the same age group, the w,’s are .8996 and .1004 (Table 418), when 
combining the corresponding white and Negro subgroups. The same relation- 
ship (1) holds, of course, for Y. 

The variance V(X) of the total group is: 


V(X) = Sw f[V(X) + F242] -—F%* @=1,---,k) (2) 


where V(X,) is the variance of the subgroup; X, X; and w,; are the same as in 
(1), above. The same holds for V(Y). (See: Kenney [6], p. 101, corollary 1.) 


TABLE 418 


RATIOS OF WHITE TO NEGRO IN THE ADJUSTED SAMPLE VERSUS 
THE RATIOS OF WHITE TO NEGRO IN THE MALE 
POPULATION, BY AGE 

















Ratios 
Age Adjusted Sample Male Population* 

White Negro White Negro 
18-19 .8875 .1125 .8950 . 1050 
20-24 .8422 .1578 .8996 .1004 
25-29 .8314 . 1686 .9047 .0953 
30-34 .8254 .1746 .9049 .0951 
35-37 .8279 .1721 -9096 .0904 

















* Based on special communication from the Bureau of the Census relating to 1943. 
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Following essentially the procedures used by Kenney [6] in deriving weighted 


variance (2), the following relationship was established for weighted covariance 
(Cov): 


Cov (XY) = >> w,[Cov (X.:Y¥) + X¥.:¥:] —XVY ( =1,---,k (8) 


where Cov (X;,Y,) is covariance of the subgroup. The other values have been 
explained in (1), above. 
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ESTIMATION OF SURVIVORSHIP IN CHRONIC DISEASE: 
THE “ACTUARIAL” METHOD* 


Lita ELVEBACK 
Tulane University 
In most follow-up studies in chronic disease assumptions on the 
parametric form of the survivorship function are not justified. A simple 
frequency estimate, arising from polygonal approximation, is dis- 
cussed. It is shown that, for proper limitation of the period over which 
the survivorship is to be estimated, bias due to nonlinearity may be 
controlled, without loss in efficiency, by proper choice of the sub- 
intervals. It is concluded that the method proposed is appropriate and 
highly efficient for large scale follow-up studies such as those carried 
out by the Mayo Clinic. 


HE problem of estimation of the survivorship function in follow-up studies 
Te chronic disease is characterized by the fact that some part of the sample 
observations are not complete over the entire period under study, and also by 
the necessity of avoiding an assumption concerning the parametric form of the 
survivorship function. 

The accepted and almost universally used method of dealing with this situa- 
tion has consisted in the subdivision of the total period into a set of small sub- 
intervals, and the estimation of the conditional probability of survival over 
each of the sub-intervals. This method, which has been in use for many years, 
has been called the “life table” or “modified life table” method. In 1950, it was 
designated as “actuarial” by Berkson [3], in order to distinguish it from a direct 
calculation frequently employed by clinicians. Since that time this designation 
has gained fairly wide acceptance in the literature on follow-up studies [2, 4, 
13, 14, 16, 17], and will be adopted here. 

Recently a variety of alternative methods of estimating the survivorship 
function have been proposed [13, 14, 17]. Some of these are based on assump- 
tions concerning the functional form of 


P(t) = Prob (survival to time ¢ following entry) 


over the entire period under study, and are unsuitable for use in most chronic 
disease studies. Others, while completely non-parametric in this respect, involve 
clerical and computational work which becomes prohibitive in large studies. 

It has been pointed out that actuarial methods are inefficient alternatives to 
the one period estimator when the form of the survivorship function is assumed 
to be known. This is true, but does not detract from the appropriateness of 
the method for the majority of chronic disease studies in which the form of the 
survivorship function is, in fact, unknown. 

It is the purpose of this paper to propose a simple actuarial estimate based 
solely on the interval frequencies and to illustrate its performance. The con- 
clusion is reached that the method proposed constitutes a very satisfactory 





* Most of this work was carried out during a Fellowship in the Section of Biometry and Medical Statistics of 
the Mayo Clinic. Acknowledgement is due to Dr. Joseph Berkson for generous help in all phases of the werk. 
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procedure for estimation of the survivorship function over a large class of 
chronic disease studies. 

For the purposes of this paper the actuarial method is taken to be one which 
is appropriate when the functional form of P(t) is unknown, and in which 


1. the period (0, 7’) under study is partitioned into a number of sub-intervals 
(0, th), ~~ e (tat, ta), 


where 4,=T. 
the parameters to be estimated are the conditional probabilities 


bo 


1 — qx = Prob (survival to k + 1| survival to k) 


for k=0,1,---,a-—1. 
3. P(t), for k=1, - - - ,a—1, is estimated as 


k—-1 
II (1 big Gi), 


t=) 
where J; is the estimate of qj. 


The method presented involves, in addition to the properties above, the 
approximation of the unknown P(t) by the polygonal function 


P(t) = 1 — qot for +. 2. 


and by 
k-1 
P(t) = 11 -—(t- Ba] 1 — qi) fork <t<k+1landk =1,2,--.,a-1. 
t=O 
Linearity over the sub-intervals is chosen as a natural first approximation to 
the unknown P(t). In general the lengths of the intervals should be chosen in 
such a way as to insure the adequacy of the linear approximation, and in most 
studies intervals of unequal lengths are suggested. It will be seen from the 
results below that the common practice of using a calendar year for each 
interval will lead to satisfactory results in many studies. For notational con- 
venience and simplicity of the numerical examples, the intervals here have been 
chosen to be of equal length. In general, the order of the approximation may 
be controlled by choice of the number and lengths of the intervals, 

The estimates suggested were developed for use in the studies conducted by 
the Mayo Clinic and are appropriate to the method of follow-up routinely used 
there. Since some of the other estimates which have been suggested [13] cannot 
be used for the sample information resulting when that method is used, a 
description of this follow-up method and an alternative method of using the 
sample information is necessary. 


THE FOLLOW-UP PROCEDURE 


The group of patient histories studied is made up of all cases in the disease- 
treatment group defined [7] and entering that group (at onset, diagnosis, 
treatment or discharge) at the clinic over some specified period. For example, 
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the group might consist of all patients undergoing and surviving gastric re- 
section for the removal of malignant neoplasm of the stomach who were dis- 
charged from the hospital during the period between January 1, 1935 and 
December 31, 1944. For each patient the date of entry into the study is taken to 
be the date of his discharge from the hospital. 

In some cases a single study of the group of patients is made. In the example 
above, if the study were initiated in January 1947, information concerning the 
experience of the patients subsequent to discharge would, in general, be at hand 
only for those patients who had returned to the clinic for treatment in the 
interval between discharge and the initiation of the study. The follow-up pro- 
cedure would consist of sending letters of inquiry to each member of the group, 
with the exception of course, of those who were known to have died before the 
opening of the inquiry. The process of mailing the letters might cover a period 
of several weeks. The period during which replies would be received, second 
letters written, and further attempts to trace the patient undertaken might 
cover from three months to a year. At the conclusion of this period the members 
of the original group would fall into the following classes: 


a,) those reported to be dead, date of death known 

b,) those who reply and are reported living at some date during the inquiry 

period. 

In some cases the follow-up is brought up to date every few years. Further 
inquiry might be made in 1953 concerning the subsequent experience of the 
group b, above. The 1953 study might include, in addition the new cases enter- 
ing the group in the interval from January 1, 1945 through December 31, 1950. 
In the 1953 inquiry, letters would be sent to members of the group }; and to 
those entering in 1945-1950. At the close of the new inquiry the total group is 
classified as: 

a) those reported to be dead, date of death known 

b,) those reported alive at some date during the 1953 inquiry. 

Two methods of using the sample information are described below. 


The Common Closing Date? 


The follow-up method described above results in observational periods which 
differ from individual to individual not only in length but also in the calendar 
date of last report. The dates of last report, with very few exceptions, fall 
within the inquiry period and may be spread out over most of a calendar year. 
The common closing date method? of using the sample information corresponds 
to choice of a single calendar date (which might be on or before the opening 





1 In addition there may be a few cases in which the patient is untraced or is known to have survived a number of 
years following entry, but for whom al] attempts to discover his status during the inquiry period fail. This group is 
kept to a minimum in the Mayo Clinic studies, in some of which complete follow-up has been achieved for 99% of 
the study group. In the discussion which follows, it is assumed that untraced or partially traced histories occur in 
negligible instances only. If the untraced or partially traced group is large and non-randomly “selected” the results 
will be biased regardless of the method of estimation used. 

2 This method is not ordinarily used at the Mayo Clinic. 

2 It is to be noted that the procedure used here is not the usual application of a common closing date to infor- 
mation obtained from a study based on the anniversary method, in which patients are followed to an anniversary 
of the date of entry, but not longer than some specified date. No common closing date is used, but each patient’s 
history is terminated at his own last anniversary of entry prior to the study date. It will be noted that if the “anni- 
versary method” were to be used in connection with data arising from the inquiry method of follow-up described 
here, that the resulting loss in discarded sample information might be greater than that associated with use of the 
study date as a common closing date. 
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date of the inquiry) which is used as the closing date for the experience of every 
individual in the study. For each individual alive on the common closing date 
the information at hand concerning subsequent experience (from the common 
closing date to the date of last report for that individual) is discarded. 

In the example described above the sample information resulting from the 
1953 inquiry might be used to establish, for each patient, his status as of Janu- 
ary 1, 1953. The longest history might cover the period between entry on 
January 1, 1935 and the closing date January 1, 1953 and the shortest the 
period between December 31, 1950 and January 1, 1953. The sample informa- 
tion becomes 


a) the calendar date of death for those who die before January 1, 1953, which 
gives the time of death as measured from entry. 

b) the survival time or the interval between entry date and the closing date 
for those who survive to the closing date, January !, 1953. 


Under this method a patient who is reported as living on August 1, 1953 and 
another whose date of death is known to be July 1, 1953, are both recorded as 
living on January 1, 1953. Some part of the observed experience of each patient 
who survives beyond January 1, 1953 is discarded. In the study described, since 
there are no entries after December 31, 1950, no information concerning sur- 
vival over the first two years following entry is discarded. 

In an effort to obtain a rough estimate of the relative cost in discarded in- 
formation which is associated with the application of a common closing date to 
observations arising from the inquiry type of follow-up described above, a 
study was made of 1559 Mayo Clinic patients who entered the follow-up ex- 
perience at the date of diagnosis of angina pectoris during the years 1940 
through 1944. The common closing date was taken as February 15, 1950, the 
date of first reply received during the 1950 inquiry. It follows that there was no 
information discarded due to termination of the study during the first five 
years of experience. A total of 619 deaths occurred during this time. At the be- 
ginning of the sixth year following diagnosis, 940 of the patients were still alive 
and under observation. Of these 221 were observed to die before the common 
closing date, and the temaining 719 were alive on the closing date, February 15, 
1950, and survived to teport at some point beyond this date. A total of 264 
years of observation, or an average of .36 year per person alive on February 15, 
1950, was discarded by use of the common closing date. On the basis of a rough 
calculation the number of years of observation used in the estimation of 
(qs - - +, Qs) was 2670, and the information discarded was roughly nine per 
cent of that available. 

In the example above in which a 1947 study of the 1935-45 entries was ex- 
tended and brought up to date in 1953, it could have happened that in 1953 
no information was obtainable on a more than negligible number of the indi- 
viduals reported alive at the 1947 closing date. The presence of such unknowns 
in the follow-up would clearly preclude the use of the common closing date 
method as presented here. 


The Date of Last Report 


The inquiry method of follow-up which has been described is that ordinarily 
used at the Mayo Clinic. However, the common closing date method of han- 
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dling the sample information is not used in the analysis of the results in these 
inquiries. Instead all information on each member of the group available up to 
the last report on that individual has been included in the sample information. 
As stated previously, it is assumed that for patients living at last report, the 
date of last report is, in practically all cases, within the period of inquiry. When 
the inquiry is completed this information consists of two calendar dates for 
each person: 


1) the calendar date of entry, and 
2) the calendar date of death 


or 


1) the calendar date of entry, and 
2) the latest date on which the patient is known to have been alive. 


The sample information is taken to be 


a) the time of death as measured from entry, 


or 


b) the time of withdrawal alive, which is the length of the interval between 
entry and the last date on which it is known that the patient was living. 


The date of last report method has the considerable advantage that all avail- 
able information is used. It has been suggested that the inclusion of experience 
subsequent to the beginning of the inquiry period will produce bias since the 
date of last report is, in a sense, selected by the person concerned. If so, the 
magnitude of the effect might be investigated empirically for each study. Since 
it is to be expected that the amount of bias introduced will depend on the 
disease in question, on the form of the inquiry letter and on many other factors, 
no single investigation would provide a basis for generalization. It is the 
opinion of the writer that the “selection bias” introduced by the inclusion of 
the post closing date information is a minor matter. 


The Sample Information Under the Common Closing Date Method 


For each person in the follow-up group let the length of history be H;=C—E; 
where C is the common closing date, and £; the entry date of the ith patient. 
If H;<T, let T;=H;—[H;], where [H;] is the greatest integer contained in Hj. 
If the ith patient survives to [H,] he is referred to as a withdrawal in the inter- 
val ((H;], [H;]+1), and if further he survives to H;=[H,;]+T7; he is referred 
to as a withdrawal alive at time 7; in this interval. 

For each person in the group let r; be the interval between entry and death. 
If +;>H;, its value is not observable. If 7;<H; let t;=7;—[7,;] and the patient 
is referred to as a death at time ¢; in the interval ([r;], [7;]+1). 

Given M individuals in the total group studied. With reference to the typical 
interval, (k, k+1), for k=0, 1, - - -, a—1, the individuals can be classified as 


N;, =the number, among those for whom H;>k, who survive to k. 


The N,; survivors to k can be further classified as 
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W, =the number, among the N; for whom k<H;<k+1 
=the number of withdrawals in (k, k+1) 
=Dyu+L,, where 
Ds, =the number who die before withdrawal 
L,,=the number who survive to withdraw alive 
N,—W,=the number, among the N;, for whom H;>k+1 
=the number of non-withdrawals 
os Die tSy where 
Dy =the number who die during the interval (k, k+1) among the . 
non-withdrawals, and 
S,=the number who survive to k+1=Ny4,4). 


Then Ni; = ( Du +8S;,) + (Dut+Lx), and for Dz = DutDu, Ni = D, -b Lk -t Sk. 


ESTIMATION OF THE INTERVAL PARAMETERS 4; 
Estimates for the Common Closing Date Method 


For the purposes of discussion of the relative merits of different estimates 
of the qx, the quantities Ni, W, and 7), - - - , Tw, are analogous to “sample size” 
and will be taken in this section as given constants. The sample observations 
are Dy and Dy and the times of death, ZL, and the times of withdrawal, and 
Sk = News = N; <= D, —- Tx. 

Since, for ¢ on the sub-interval k<i<k+1, 


Prob (survival to t| survival to k) = 1 — (t — k)q, 
then, for any ¢ in the interval 
Prob (death in ¢, t¢ + dt| survival to k) = qdt. 


It follows that the contribution to the likelihood function for a death during 
the interval, among the N—W persons whose histories are complete over the 
interval, is simply q: regardless of the time of death as measured within the 
interval. The assumption that the survivorship function may be satisfactorily 
approximated by a straight line over the interval is equivalent to the assump- 
tion that the information is essentially complete when the time of death is ob- 
served only in terms of the interval in which it occurs. 

The sample information for the interval is taken to consist of N,.— Wy, ob- 
servations on a binomial variable of parameter g., resulting in a contribution 
to the likelihood function of 


qe? *(1 — qu)N*-We-Dik, 
and a single cbservation on each of W, binomial variables of parameters 


qT, +> +, &T w,, the typical, or jth, of which results in a contribution to the 
likelihood function of 


qT’; if death occurs between k and k + T;, 


or of 


1 — qT; in the event of survival to k + T). 





426 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 


Choice of the conditional probabilities q as the parameters to be estimated, 
leads to a likelihood function which can be written in the form 


a—1 


o=Ile 


where 


Dok Lk 
vi = ge? **(1 _ g)rrrntl II wT.|| II -.. at) |. 
ro! 


m= 
It follows that 
0? lo 
g ? =O 
99:04; 


for all i#j, and the asymptotic covariance matrix of the maximum likelihood 
estimates (go, - ~~, 9a-1) is diagonal, so that all asymptotic covariances be- 
tween the §; are zero. 

The asymptotic variance of §, is given by 


gt | -1 
(a(t) 
0q;? 
For notational convenience in the discussion of the typical interval, (k, k+1) 


and the estimation of q, the subscript k will be dropped. The maximum likeli- 
hood estimate,‘ g, is the solution of 


L , 
‘ —__—} = 0 l 


and must be evaluated by an iterative procedure. Further 
0 lo D N-D-L . Te . 
i oreo Red epee >(- ) 
dq’ Vi (1 — q)? 1 — qT. 


7 





4 


Ww 
" k log ‘) _ (N -— W)q+ & aT “ (N — W)(1 — g) 


dq? ¢ (1 — q)? 


w T; 2 
+ 2 [ (a) 0 - | 


N-W w T;? T; 
= + z( +=), 
ql — q) 1—q7T; 4q 


N-W 18/ 7; 
Var g)-! = +— ( : ). 3 
widen qi-q) 4 2 1 — qT; mn 


‘ This estimate was fret proposed by Berkson, on the basis of a heuristic argument [4]. See Appendix, Note 2. 
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The clerical and computational work is greatly simplified if the estimate is 
based solely on the interval frequencies. Several such estimates have been sug- 
gested and used in application [1, 2, 3, 4, 8, 9, 13, 16, 18]. A discussion of their 
properties must be carried out within the framework of a model which includes, 
in addition to an assumption on, or an approximation to, P(t) over the interval, 
information concerning the distribution of the 7; for the withdrawals. No 
direct set of observations on the 7’; under the common closing date method has 
been made in connection with the Mayo Clinic data. However, for the date of 
last report method the 7; for the withdrawals alive have been determined for 
some 1250 cases, of which 450 were taken from a study of cancer of the stomach 
and the remainder from a study of angina pectoris. For each disease several 
typical intervals were used. The resulting set of six distributions of times of 
withdrawal alive showed no marked or systematic departure from that ex- 
pected under a uniform distribution of the 7; values for all withdrawals, which 
is given by 

q 


1 
T) = _ 7 ren 0 < T < 1. 
Km) 1—@¢/2 1-—@q/2 es rolls 





Use, as a averaging device, of the assumption that the 7; arise as observa- 
tions on a uniformly distributed random variable will be made in the following 
discussion. It is believed that this assumption will serve as an adequate ap- 
proximation in most follow-up studies of the type under discussion. 

This simplifying assumption gives, for the W withdrawals 


Prob (death before withdrawa! time) = q/2. 


If the 7, observations are censored, and this averaging device is adopted, 
then the sample information is regarded as 


N — W observations on a binomial variable of parameter gq, 
and 


W observations on a binomial variable of parameter qg/2. 


q De q L 
= g?Pi l1— yr-o-n( 2) (1 ain +) 
o=qril-gq 9 9 


and the maximum likelihood estimate corresponding to this simplified set of 
observations is® 


Then 





l LS chil ne 
qi = aw 2 -L+D—VJA(2N —- L + D)? - SDN |. (4) 


Again the asymptotic variance is given by 


(-o("asy" 


* This estimate was also proposed by Berkson as an approximation to the solution of (1). 
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loge D N-D-L L 
dq’ q’ (1 — gq)? (2 — q)? 
q 
N-—-W)q+w— 
[= log ‘) * ( q 2 (N — W)(1 — q) 
dq? ” ¢° (1 — q)? 











and 
q(l — g)(2 — q) 


W\ 
v(2 —-q- x) 
N 


It is of interest to include in the comparison the frequency estimate 


D 
Ge ee 
y= — 


| 


~~ 





Var Gt = 


discussed by Littell [13]. Under the same assumptions used above, since D is 
the sum of two independent binomial variables, 


q q\ 
, i y\ am za exes. 
(N — W)q(l — 9) + W — (1 - ) 


Var D, + Var D. 2 


(N — W/2)? (N — W/2)? 





Var qw = 


To evaluate the loss in efficiency due to discarding the T; values for the 
withdrawals alive, the variance of the maximum likelihood estimate corre- 
sponding to the use of the entire set of observable random variables has been 
evaluated by use of the same simplifying assumption on the distribution of 
the 7;. From (2) and (3) 


; N-W W f'! 
[Varg! = +— f — 
q(1 — q) q “0 
N-W W fo + log (1 )] 
a7. CC 
Under linear survivorship the asymptotic efficiency of the frequency esti- 
mates is high, even for large g and a large proportion of withdrawals. For 
example, if g=.2 and W=.2N, the efficiency of g, and gw are .987 and .984 
respectively. If g=.5 and W=.5N, the corresponding efficiencies are .962 and 
.927. Note that the use of gw is possible only if the common closing date is used. 


Estimation Under the Date of Last Report Method 


If all of the available information is used, the H; values are no longer defined. 
For those who are alive at the date of last report, H,’ may be taken as the in- 
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terval between entry and date of withdrawal alive. If H,’/<T =a, then for 
some k 


Hi =k+T; where 0<7;<1 


and the individual is referred to as a withdrawal alive at 7; in the (k+1)st 
interval, and is a member of the group L;. For those whose date of death is 
known, no similar “observation time” or “length of history” is defined. The 
sample information for the typical interval consists of N, D, Land T,, ---, Tr. 
The frequency W is no longer observable and the D; and D, groups are no 
longer distinguishable. On the other hand, the information required for the 
use of either g of (1) or gz of (4) is available. However the evaluation of the 
asymptotic variances cannot be carried out without further assumptions on 
the lengths of history or observation times. In the model discussed below the 
common closing date method and the date of last report method correspond to 
different systems of censoring of the sample information. 


A GENERAL MODEL 


For each member of the total group, the length of history, H, is taken to be 
an observation on a random variable, independent of the time of death, and 
with distribution given by 


P(H >t)=1-—pod for 0<t<1, 
and by 


k—-1 
P(H > t) = [1 — (t— kee] J] 1 — 0) for k < t Sk +1 and 
. t= (0 


px = Prob (H <k+1|H>k). 


The p are parameters to be estimated. The time of death r, is governed by the 
survivorship function 


P(t) =1—qt for 0O<t<1 

and 
b—-1 

P(t) = [1 -( -—Ha] [] 0 —@) fork <t<k+1andk=1,---,a—1. 
t=O 

and is observable only in those cases in which 7<H. 


The probability, 
P(t) = Prob (survival to ¢), must be distinguished from 


Q(t) = Prob (survival to ¢ and under observation at ¢) 
= Prob (H > t and survival to #). 
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Fix and Neyman [10], in describing a competitive risk model* for use in chronic 
disease studies, refer to probabilities such as P(t) as “net” and to probabilities 
such as Q(t) as “crude.” Note that 


P(k) = [Tl -@)), 


whereas the expected number of persons alive and under observation at k, or 


k—1 , 
E(N,) = M JJ (1 — qd — 0) = MQ(R). 


Under the general model, given survival under observation to k the prob- 
abilities of the observable events are, for k<t<k+1 


Prob (withdrawal alive in t,t + dt) = p.x(1 — qat)dt 
Prob (death observed in é, ¢ + dt) = qx(1 — pxt)dt. 


Again the choice of the conditional probabilities g, and p, as the parameters 
to be estimated, leads to a likelihood function which can be written in the form 


a—l 
¢= II de, 


kenQ 


Dé Le 
a = fe — px) (1 — a)}| II ad — ait) || IT p.(l — uTn) | 


n=l m=) 


¢ can also be factored into functions of p, and q separately. It follows that 
all asymptotic covariances between estimates (go, - - - , Ga—1, Bo, - - - , fas) are 
zero, and again the reciprocal of the asymptotic variance of 4 is 


7 


0? log ¢ 


For the typical interval, dropping the subscript k, and for N persons alive 
and under observation at the beginning of the interval, the maximum likeli- 
hood estimate @ is the solution of 

dlog@? D N-D-L > ( Te ) > 


oq q l1—q pra 


1 — qT 





* The model described above is equivalent to a competitive risk model in which the ititensities of risk of death 
and of withdrawal alive, rp(t) and rz(t) respectively, are, for k <t <k+1, given by 


Qk 
rp()dt = Prob (death in t, ¢ + dt|survival tot) = dt, 
1 — at 


Pk 
ee Sere > eae dt. 
— prt 


The quantity rp(t) is also referred to as the “force of mortality” and is related to the net survivorshiy by 


d 
rp(t) = — — log P(t). 
dt 
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which corresponds exactly to the solution of (1). Also 
0° log ¢ ‘D N-D-L £ ie s 
- 2( aa *) BE) sae Br Su >(— ~_)} 
« ae q’ (1 — q)? m=i \1l — QT» 
1—p/2 1- . rr 2 p(1 — qT 
= N ail +— < + o(1 _ yf (~ ) o( : ar , 
ee l—q 2/J3,\1—q@T/ pli —@/2) 





—s p 
(Var g)'=N Pali fics lq + log (1 -o)t. 
Wa-o ¢ 
Finally, removing the condition on N; persons alive and under observation at 
time k, and replacing N; by its expectation. 
, 1 — px Pk one 
(M Var gy = {——— ta + tog (1 - I TL = a0 - po. 
gq(l — qe) q* i=0 
For the typical interval, the maximum likelihood estimate of p is the solution 


Frequency Estimates 
For the frequency estimates of g and p of the typical interval, since 


Prob (death observed during the interval) = q(1 — p/2) 
Prob (withdrawal alive in the interval) = p(1 — q/2), 


the joint sample probability becomes 
@ = [(1 — e)(1 — g) }$[a( — 0/2)]? [od — g/2)]*. 


The solution of d log ¢/dq=0 gives, as the maximum likelihood estimate of q, 


the quadratic estimate 9, of (4). Similarly 


l ame 
- [2N —-D+L— J/QN —D+L)? — 8LN I. (5) 





ir = - 


(2 log ¢ jD N-D-L L 
—E( ——— } = E%- ——— 
( ) y* (1 — )? +e@-o 


oq? 
; y 1 — p/2 1 —_ p p/2 
ay firearm e, oy 
gee l1—-q 2-¢9@ 





and, for N given 
$2 | 


Var Gu 
N(2 — q — p) 
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Again, removing the condition that N; is given, and denoting by ¢1, the quad- 
ratic estimate of q 





is a k-1 
qe(l — qu)(2 qt) where E(N,) = M[[(1—q)(1—p,). (6) 


Var gx, = 
* (2 — ae — a) E(N:) ind 


The Effect of Non-Linearity of the Survivorship Function 


In general application to large stucies in chronic disease, in which the form 
of the survivorship function is unknown, the actuarial method is taken to be 
the method of choice. The question as to how the (go, - - - , ga—1) Shall be esti- 
mated remains. With reference to the choice of the frequency estimate gz of 
the linear model the following questions arise: 


1. How sensitive is gz to non-linearity of P(t) over the sub-intervals? 

2. What is the cumulative effect of the bias of gz on the estimate P(T) when 
P(t) is non-linear? 

3. What is the effect on the variance of P(T) of using a larger number of 
smaller intervals in order to decrease the bias? 


Concerning the last of these questions, Littell [13] discusses an actuarial esti- 
mate of P(7), namely one based on gw, for a model in which the functional 
form of P(t) is known to be exponential, and for which P(t) is estimated over a 
period (0, 7) on which all of the available histories become incomplete. He 
concludes that the efficiency of the actuarial estimate is low, and that subdi- 
vision into many small subintervals is to be avoided on the grounds that the 
desired decrease in the bias of P(T) is accompanied by a rapid increase in its 
variance. 

Since in many chronic disease studies the assumption of an exponential P(t) 
over the entire period (0, 7’) may be inadequate, it is of more interest to com- 
pare the performance of the proposed actuarial estimate based on g, with the 
optimal estimate for the case in which P(t) consists of a series of exponential 
ares over the sub-intervals. To avoid extrapolation the period (0, 7’) is re- 
stricted so that some histories complete over the entire period are available. 


The Alternative Model 


Following the method of Fix and Neyman [10] and taking here for the 
alternative model, 
p 


ro(t) =n and rz(t) = 
1 — pt 


the required probabilities become, for the typical interval, and 0<t<1, 


Prob (survival under observation to t) = (1 — pl)e~*' 
Prob (death before t) = (1 — e~**)(1 — p/n) + pte 
z. (1 — e**) 

0 
Prob (death observed in ¢, t + dt) = (1 — pt)ye~*‘dt 


Prob (withdrawal alive in t, t + dt) = pe~*‘dt. 


Prob (withdrawal alive before t) = 
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The maximum likelihood estimate of q for the typical interval is 


D 


L D 
S+ > T, + bm 4 





g=1-—e* where f= 


and the asymptotic variance is 
(1 — q)*u? 
E(D) 
where E(D) is the expected value of D, and for N given 


E(D) = N | oc —~9+ a(1 i‘ *)|. 


Again, removing the condition that N is given, 


Var g = 


314 (1 — qx)? ux? 
Var ¢& = ———_—_ 
E(D,) 


where 


B(D:) = | ox(t — @) + a(1 ~ 2 |r Ia-p)a-@. ©) 


& Mk i=0 
Since gz is biased under the alternative survivorship function, the measure 
of efficiency which has been chosen is 
N Var @ N Var @ 
~ N Var gz + NB? N[MSE 4] 





Ew 


where B, the asymptotic bias of 31, is gz*—q, and qz* is the result of replacing 
D and L in (4) by their expected values. Note that, since the bias of g, is inde- 
pendent of N, the efficiency, for fixed g and p, is a decreasing function of N. 

For a sub-interval on which p=}, u=.2, g=.1813, an interval “longer” in 
terms of p and q than would ordinarily be used, the efficiency’ of gz relative 
to g, for N = 1000 is 


; .189796 
E000 == a = 95. 
.197244 + 1000(.00136)? 





If the interval is shortened to ».=.1 and p=, the corresponding efficiency, for 
N = 1000, is .99. 

Let the estimate of P(7T) based on the use of the maximum likelihood esti- 
mate (7) on each sub-interval be P(g, 7), and that based on the quadratic 
frequency estimate (4) on each sub-interval be P(gz, 7’). The efficiency measure 
is taken as the ratio of the variance of Pq) to the MSE of P(x). All variances 
and biases are, again, asymptotic. The variances of the g, are given by (8). The 
asymptotic variances of the gz, and of the two estimates of P(7') have all been 
obtained by use of the “delta method” [6, pp. 353-66]. 

The bias of 

a—l 
P(x, T) = I] 1 — x) 
k=O 


1 For variance @z under the alternative see Appendix Note 3. 








AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 


Bl P(gx, T)] = | I (l — a) | APE) 


where each @,, is the result of replacing N, D and L im the estimate as given in 
(4) by the expectations of the appropriate Ni, D, and Ly. 

The expression for the asymptotic variance of §z,, explicit in p and y, is 
lengthy and of no practical value from a computing standpoint. The values of 
the Var @:, have been evaluated numerically, step by step, as described in 
Note 3 of the Appendix. 

The “delta method,” as applied to an estimate 


a--1 


P(T) = [J G — 4,), where 9; is an estimate of qi, 


t= 
ives,® f values of th toti variances betw ; and 
gives,° for zero vaiues 0 e asymptotic covariances between 9; and 9;, 
on Vor G 


Var P(T) = [P(T)j? >}. ————_ (9) 
=0 (1 ” qi)? 


which is known as Greenwood’s approximation [11]. This expression has been 
used for the asymptotic variance of each of the two estimates of P(T). 

The increase in the variance of the actuarial estimate with increasing number, 
a, of sub-intervals observed [13] in the case in which, for each a, pa_1=1, does 
not occur here with (0, T) restricted so that some part of the histories are com- 
plete over (0, 7’). The interval (.87,, T) corresponds, for a=5, to the last inter- 
val, for a= 10 to the last two intervals, and for a=20, to the last four intervals. 
The contribution to M Var P(g:, T) of this part of the total period decreases 
with a, from .1923 for a=5 to .1901 for a=20. 

For each a, the bias of g; is maximum on the last sub-interval (a—1, a). 
The bias on the last interval decreases from .00625 (approximately .02¢) for 
a=5, to .00017 (approximately .002g) for a=20. 

It is of interest to note that, if the familiar estimate 


Vu. = aoe (10) 


is used on each sub-interval, and P(7) is estimated as Tl (1—@y,), that the 
bias of the resulting estimate of P(7’), is, for each value of a, approximately 
double that of the estimate based on @,,. The approximation, @;,, is not con- 
sistent under the general linear model. Since §z, is based on the same set of fre- 
quency observations, and shows better performance, the approximation (10) 





* Since gz is biased under exponential survivorship, (9) becomes 
. - Var @ 
Var P@z, 7) = (PCM) S ——*- 
(1 —@z,)* 


= a—l 
P(T) = 7 (1 - G,)- 
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will not be discussed further, although for use with intervals corresponding to 
small values of both gq and p it will fulfill the accuracy requirements in some 
applications. The performance of the estimate of P(T) based on the use of gx 
on the sub-intervais, relative to that based on use of the maximum likelihood 
estimate on each sub-interval, is summarized in Table 436. 


The Estimation of P(t) Under the Alternative Model 


In the illustration of the performance of g, under the alternative model, the 
net survivorship is taken to be P(t) =e~*“‘ on (0, 7), and uJ =2.0, so that P(T’) 
= .135335. The estimates based on @, are discussed for a=5, 10 and 20 sub- 
intervals. The p, values® have been chosen to approximate situations met in 























application. For each value of a, p.=5/[a+5(a—k)] for k=0, 1, ---, a—1. 
TABLE 435 
VALUES OF THE PARAMETERS USED IN THE ILLUSTRATION 
No. of 
intervals Po Pi P2* * * Pos Pa q=l—e-w 

A 

5 1/6 | = «1/5 1/4 1/3 | /2 .32968 

10 1/12 1/11 1/10---1/4 | 1/3 18127 

20 1/24 1/23 1/22---1/6 | 1/5 09516 











In the illustration, for any given a, the q values do not change from one sub- 
interval to another. For each value of a, [{ (1—p:) =%, and of the total of M 
patients, M$P(T) =.0225 M are expected to survive under observation to time 
T. This schedule involves fairly heavy withdrawal rates, particularly in the 
first few intervals. In practice, the entries of the last few years are not used so 
that for the first two to five years of follow-up there are no incomplete histories. 

The bias of the estimate P(g, T) due to non-linearity of the survivorship 
function P(t) =e~“*, is essentially eliminated by division of the period (0, T) 
into a=20 sub-intervals. The reduction in bias is accomplished without loss 
in efficiency. 


CONCLUSIONS 


The actuarial method is appropriate for a large and important class of follow- 
up studies of survivorship in chronic disease. The frequency estimate, @z, arising 
as the maximum likelihood frequency estimate from the approximation of P(t) 
by a polygonal function is a very satisfactory estimate of the interval prob- 
ability. The clerical and computational work involved in its use is simple. It is 
appropriate for use with either the common closing date or the date of last 
report method of recording the sample information. The results presented are 





* Note that the schedule of withdrawals corresponding to the sets of p values used here are those which would 
result from use of a common closing date applied to uniform entries, for which R /M =} of the histories are complete 
to T, and (M —R) /a =w histories become incomplete on each sub-interval. Then R +(a —k)w histories are complete 
to k, and of these a proportion p, =w/[(R+(a —k)w] become incomplete on k, +1. While the pk values chosen for 
the numerical example correspond to uniform entries, this is not a requirement under the general model, since the p, 
are not subject to this restriction. 
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TABLE 436 
e—l 


VARIANCE, BIAS AND EFFICIENCY OF P(@z)=[] (1—9:,) AS AN 
ESTIMATE OF P(T) 
































Number of sub-intervals a=5 a=10 a=20 
Sub-interval* q = .32968 q= .18127 q= .09516 

M Var PG, T) . 30298 . 32005 . 32488 

M Var P(@z, T) .33117 .32812 .32699 
B[P@:, T)] .00322 .00081 .00009 
E,o0[P (Gx, T)] . 887 .973 .993 

Eso00[P (qx, T)] . 836 .970 .993 

* For each a, the valueof q is the same for all sub-intervals. ‘i ue p; values for each a, and for k=0, 1,- ++ ,a—1 


are given in Table 1LE 


all based on large sample approximations and on asymptotic variances. Suitable 
restrictions must be placed on M, the total number of histories, and on the 
number of histories complete over the entire period (0, 7). 
The conditions necessary for satisfactory results with the actuarial estimate 
Pz) are simple. 
1. The actuarial method, designed for the case in which the form of the 
survivorship function is unknown, does not lend itself to extrapolation. 
The period (0, 7’) over which P(é) is to be estimated must be restricted so 
that it is covered by a reasonable amount of experience under observation. 
As a practical rule, it might be recommended that T be chosen so that at 
least 25 persons be observed to survive the interval (0, 7’). 
2. Since approximation to P(t) by a polygonal function is involved, the sub- 
intervals must not be too “long” in terms of p and gq. 


If the first of these conditions is met the bias may be controlled by increasing 
the number of intervals with a resulting gain in efficiency. Maximum interval 
lengths corresponding to g=.10 and p=} might be recommended for general 
use, although in some studies maximum values of g=.2 and p=} will give the 
accuracy required. 


APPENDIX 

Note 1. An Example 
The first four columns of the table below are taken from those given by 
Parker, Dry, Willius and Gage [16, page 97, Table 3], and are based on dates 


of last report. 
The authors base their estimates of P(t) on the interval estimates (10) 


« dD: 
i. = 
Ni - — 
9 


_ 


which gives a fifteen year survivorship of .143, whereas the use of g, on each 
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interval leads to an estimate of .141. The intervals of one calendar year used 
are rather “long” in terms of the q and px. 


SURVIVORSHIP IN ANGINA PECTORIS 






































1 2 | “ee 5 6 7 8 
Last Report SE? iz) 
x N | D L au P(t, zx) p 100 — 
| | | P(a) 
me a eae SEE RE REY OTERO TS ENS, oat 
0 2418 456 | 0 | .18859 | 1.0000 | 0 
1 1962 226 39 | .11642 | .8114 -0211 1.0 
2 1697 152 22 .09018 .7170 .0136 1.3 
3 1523 171 23 .11318 | . 6523 .0160 1.5 
4 | 1329 135 24 .10256 | .5784 .0190 1.8 
5 1170 | 125 107 .11228 .5191 .0969 2.0 
6 938 83 133 .09560 .4608 . 1489 2.2 
7 722 74 102 .11078 .4168 . 1496 2.5 
8 546 51 68 .09996 .3706 .1311 2.9 
9 427 42 64 . 10682 .3336 . 1583 3.2 
10 321 43 45 . 14491 . 2979 .1511 3.7 
11 233 34 53 . 16659 . 2548 . 2481 4.4 
12 146 18 33 } 14035 . 2123 . 2431 5.4 
13 95 9 27 | 22162 | . 1825 .3010 6.5 
14 59 6 23 . 12845 . 1622 .4166 7.6 
15 30 .1413 9.5 
Column 
1. 2=Years following diagnosis at clinic. 
2. N =Number of persons alive at beginning of interval. 
3. D=Number of deaths observed in the interval (z, r+1). 
4. L = Number of withdrawals alive in the interval (z, x +1). 
5. op from equation (4). 
6. P(@, z) =II (1—gz,); and 7. p from equation (5). 
8. Var gz evaluated from equation (6) and 


SEp,z) from Greenwood’s approximation, (9). 


Note 2. Other Frequency Estimates 


Berkson [4] arrives at equation (1) heuristically by “considering q to be given 
as the observed deaths plus the deaths expected among the living withdrawals 
after withdrawal, divided by the total number living at the beginning of the 
interval,” or by setting the estimate, for linear survivorship as 


l L 
q= y {p + >> [Prob (death in T,,, 1) | survival to Tal} (1A) 


4 


which leads to the likelihood equation (1). 
Littell [13] arrives at his “effective persons exposed” estimate, for exponential 
survivorship as the solution of 


1 w 
q = ~{P + Er > [Prob (death in 7, yk (2A) 


where Ey denotes expectation with respect to 7 which is uniform over the 
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interval. Under linear survivorship (2A) gives gw=D/(N —W/2), and, if ex- 
pectation with respect to 7’ is dropped, gives 


Ww 
tw = p/|n -yu- r) |. 


These estimates are available only if a common closing date is imposed. 
It was surprising to the writer to find a survivorship function for which the 


maximum likelihood estimate of g involves only N, D, W and the 7), - - - , Tw. 
Let 
l1-q 
P(t) = ——-— forO <ti< 1. (3A) 
1—q+¢q 
Then 





vine Laeh lt ake lb ts 2 


—~q+qTn l~q+q?a 


and 





aq q i-% 1—q+ qT; 








a lo D N-D Ld 1 — T; 
HB NPs E(t 0 


which is equivalent to the expression resulting from dropping the expectation 
with respect to T in the second term on the right of (2A), or to 


1 — {0 — 90 — T)) 

omy [Ptad( *) | 
N 1— q+ QT; 

It is interesting to note that, for this survivorship function, (1A) gives 


D 
————_— (4A) 


#2 - ‘ 
N = _. ( l Secs Tm) 
If expectation with respect to the T distribution were introduced in the second 


term on the right of (1A), the estimate of (4A) would reduce to the familiar 
form (10) 








q= 


i (5A) 


The estimate g,, of (5A) has been widely used [2, 3, 16, 18] as an approximation 
to 9x, which is the solution of the equation 


D 





qu = L 


epee 


2— gi 


Note that 4, is consistent under the survivorship function (3A). 
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The survivorship function of (3A) corresponds to a decreasing intensity of 
risk 





and is an interesting addition to P(t)=e-“' and P(t)=1—qt which have con- 
stant and increasing intensities respectively. 


Note 3. The Asymptotic Variance of the Estimate 4, Under Exponential Survivor- 
ship and Linear Withdrawals 

On the typical interval, for N given, and N=D+L-+S, 41 is a function of 
the random variables D and L. 

Let 





1 ee 
gz = f(D, L) = : [2N —-L+D—/2N —L+D)*?—8DN] (6A) 


and let f[E(D). E(L) | denote the result of replacing D and L by their expected 
values E(D) and E(Z), in the right hand side of (6A). Expansion to linear 
terms, of f(D, L) about E(D), E(L) gives 


f(D, L) — f[E(D), E(L)| = [D — E(D)|fo* + [L — E(L)|fi* (7A) 


where fp* and f,* are the respective partial derivatives of f(D, L) evaluated at 
E(D), E(L). Squaring and taking expectations on both sides of (7A) 


E{f(D, L) — f[E(D), E(L)]}? 
=> (fp*)? Var D + (f1*)? Var L + 2fp*fr* Cov (D, L). (8A) 


For gx, and for D and L observations from a multinomial, the right hand side 
of (8A) may be taken [6, pp. 353-66] as the asymptotic variance of @z. 
For 
q* = f[E(D), E(L)|] and A*®? = [2N — E(L) + E(D)]* — 8NE(D) 
fo* = (2 — q*)/A* and f,* = q*/d* 


and 
1 
Var gu = Aw { (2 — q*)? Var D + (g*)? Var L + 2q*(2 — q*) Cov (DL)}. 


In the computations throughout N has been replaced by the appropriate ex- 
pectation of N;, that is by 


k-1 
E(Ni) = M II (1 -- ps)(1 — qi). 


The resulting variances are identical with those given by considering each §1, 
as a function of three random variables D,, Ly, and G, = M — N; having expecta- 
tions 
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k—1 
E(G,) = M-—M{[] i — 1) — gi) = M — E(N,) 
E(D,) = E (1 —- *) + p(1 — «)| E(N,) and (9A) 
Kk 
PRIK 
E(L,) = —- E(N,). 
Me 


Note that M =@4+Dit]IatNiws 
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1, INTRODUCTION 


HE real or fancied difficulties in computing M. G. Kendall’s tau coefficient 

[1] have retarded its general use as a measure of correspondence between 
rankings. However, when tau is considered as a coefficient of disarray, that is, 
as a measure resultiug from the least number of moves required for trans- 
forming one ranking into the other, it will yield to simple graphic interpretation 
and computation. The formula for tau as a coefficient of disarray is 





2s 
T= es —y 
()) e 
2 . 
which may also be written as 
, 4s (2) 
. n(n — 1) 


where 


s = the minimum number of interchanges which will transform one of 


the sample rankings into the other, and 


1 
()=5n@-D 


441 
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2. HOLMES’ GRAPHIC COEFFICIENT OF CORRELATION 


Back in the 1920’s, a Canadian educator, 8. D. Holmes, now text-book 
editor, Ontario Provincial Department of Education, Toronto, developed a 
graphic method for demonstrating correlation [2]. Although Mr. Holmes’ 
purpose was merely to use this graphic method as an illustrative teaching 
device, he carried it through to the development of the formula 


4X 


~ an — 1) 


(3) 


r= 1 


where 
r, = the graphic coefficient of correlation, and 
X = the number of intersections resulting from connecting the two rankings 
of each member by lines, one ranking having been put in natural order. 


Note the similarity in structure of formulas (2) and (3). 

Neither 8. D. Holmes nor his mentor, Peter Sandiford, were aware that a 
formula for rank correlation had been uncovered. They were interested in r, 
as an estimator of the Pearson product-moment correlation coefficient. When 
during the years 1929 to 1931 I sent Sandiford the results of studies made of 
r, as an rank-order correlation coefficient, he discouraged publication as being 
of little practical value. Our uncertainty of its standard error of estimate 
seemed particularly disturbing to him. 


3. GRAPHIC COMPUTATION APPLIED TO A SIMPLE RANK-ORDER PROBLEM 
3.1. Basie Procedure 


Let us apply Holmes’ graphic method to an illustrative problem given by 
Kendall [1], p. 7f. Consider the two rankings of seven: 


A 1 2 3 4 5 6 7 
B 6 3 5 7 1 20 4 (Problem 1) 


When writing this down we should leave enough space between the lines for 
satisfactory revelation of the intersections we shall make. Thus, 


A | Su Ba 4. 2 6...7% 
X= TRUM Ml =\3 
B= -0.238 


a ° 8.3. Bid oe ee 


Connect similar ranks with lines, which need not be perfectly straight, counting 
intersections as the lines are drawn. We obtain 13 intersections, which, sub- 
stituted in formula (3), gives r,= —0.238. This agrees with the result obtain- 
able by the traditional methods of computation. It is apparent that Holmes’ 
r, is the same as Kendall’s 7, and that the graphic method can be used in ob- 
taining tau. Consequently, formula (1) may be used for the graphic computa- 
tion of tau, letting s be the symbol for the number of intersections. 
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3.2. Modification when Correlation is Patently Negative 


If one set of ranks be reversed while the other is fixed, the value of tau 
changes in, and only in, sign. This reversal may be done to simplify computa- 
tions when preliminary inspection indicates the value of tau will be negative. 
The array in Problem 1 makes it apparent that correlation is negative. In such 
cases as this, some labor can be eliminated by reversing the order of one line. 
Thus, 


When one line or axis is reversed, change the sign of the value of the tau 
obtained. 


4. TIED RANKS 
4.1. Ties in One Ranking Only ’ 


In case of tied ranks, the graphic method presents few difficulties. In ap- 
proaching the problem of tied ranks graphically, we might assume in each case 
of ties that differences exist, but the measuring device failed to distinguish 
these differences. A literal acceptance of this statement might suggest that we 
construct a series of charts for all possible ways of assigning integral ranks to 
the tied measures, compute tau for each, and take the arithmetic mean of these 
values as the average coefficient. If there were merely one pair of tied ranks, 
this could be done, for there are only two possible arrangements for a pair of 
ties. From here on, difficulties are compounded, since there are six possible 
arrangements for three objects, 24 for four, 120 for five, and 720 for six! How- 
ever, in practice, if the ties, irrespective of complexity, are in one ranking only, 
we need construct but two graphs, each eliminating the tied ranks—one mini- 
mizing, the other maximizing possible differences. Then we total the intersce- 
tions from the two graphs, thus obtaining 2s to use in formula (1). An example 
will make the process clear. Let us consider these two rankings of seven: 


A 1 2 3 4 5 6 7 
B 2 1 4} 43 43 7 4} (Problem 2) 


When we minimize differences, the rankings become 
l fr ae ee 2 


4 sicher 


ee le oe oo 








If we maximize differences, we obtain 


At? 22—e? 
NY i Tae 
’ \ 

B21 &6 47 3 
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From the above we get 2s= 10, that is, s, +82; and r,= +0.524. 
If one seeks the refinement of Kendall’s adjustment for tied ranks [1], 
Chapter 3, the graphic method supplies the needed information, since 


s - (*) 2 (4) 
os . — 2s, 


where S is a symbol used and defined by Kendall. It is the summation of the 
totals of positive scores (P) and of negative scores (Q). This provides the 
numerator in Kendall’s alternative formula for 7 with tied ranks [1], p. 35, 2d 
ed., which he writes as 


S 


= ' ’ 
Vv {4n(n — 1) — T}V{4n(m - 1) — VU} (5) 





Tb 


where 


l 


T = > t(t — 1) or one-half the summation of the various sets of ties 
t 


bo | 


in one ranking, and 


1 
U= Zz > u(u — 1) or one-half the summation of the various sets of ties 


u 


in the other ranking. 
Personally, I prefer a restatement of formula (5) as 


S 
m= , (6) 


VAQ)- EG) VQ) - EQ) 


which makes it possible to substitute values of (3), (;), and (2) for all elements 
in the denominator of formula (6) directly from section 5 of this paper. This 
“Table for (3), and for (4) and () in Calculating T and U with Tied Ranks” is 
one which I wrote on an end page of my copy of Kendall’s Rank Correlation 
Methods. It is elementary, but I find it convenient. We shall apply formula (6) 
and data from section 5 to Problem 2. There are no ties in the A-ranking, so 
the first half of the denominator within its braces becomes 21 —0=21. There is 
one 4-fold tie in the B-ranking, so the second half becomes 21—6=15. Hence, 





11 
n° 
Vv (21 X 15) 

= + 0.62 
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4.2. Ties in Both Rankings 
Where ties are found in both rankings, it is best to construct four graphs— 


minimizing and maximizing the differences for each ranking. Consider these 
two heavily-tied rankings of ten: 


A 2 2 2 45 45 8 8 8 8 8 
B 5 2.5 1 5 75 25 9 5 10 7.5 (Problem 3) 


Minimizing both, we obtain 


AlT?s 4°53 6786 7 
| 5=mm=10 


Pn SF . 
B42157396608 
Maximizing both, 


A 3 2 1 5 4 10 9 8 7 6 
B 6 3 1 5 8 2 9 4 10 7 


but note: In all methods for obtaining tau graphically, it is vital that one of the 
two rankings be in natural order. Therefore, as both rankings have been dis- 
arranged in maximizing differences, it will be necessary to make suitable adjust- 
ment before drawing our lines. Thus: 


v 


Alt23456789 10 


5, = TRIM =\7 
Bisé65 70492 
Minimizing A, maximizing B: 
Ai 2?sa23.64 78 9 
5, = WWM =\5 


S63 3 S822 9 4 6 7 
Maximizing A, minimizing B: 


A 3 2 1 5 4 10 9 8 7 6 
B + 2 1 5 7 3 9 6 10 8 


Again, a rearrangement is necessary: 


Al1l234567890 
4, = TKI = 2 


Biz2zavsSewuoe6e?s 


Therefore, 4s =54 and 2s=27. By formula (1), 7, = +0.40, and by formula (4), 
S=18. By formula (6) and section 5, 
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18 
 V{45 — 84+1410}/{45 —-34+14+D} 
18 
~ (31 X 40) 
= + 0.511 


Construction of more than four graphs in solving any rank-correlation prob- 
lem involving two variables appears to be unjustified. 
4.3 Proof that the Graphic Method for Ties gives Kendall's S' 


A simple proof that the graphic method for ties will yield Kendall’s S may 
be had as follows: Let T be the number of pairs of corresponding ranks for 
which at least one tie appears. Then 


(()-P+ a, 
’) = Q+7, 


and 
S=P-Q 
= (°) 2(Q + 47) 
~‘\g 
Also, 
4s = 4Q + 27, 
28 = 20+ T 
= 2(Q + 3T). 


That 4s=4Q+2T may be seen as follows: consider each pair of corresponding 


c ’ d 


Without loss of generality, we may suppose a <b. Then consider the six possible 
sign relationships, as in the following table: 





1 For the proof I am indebted to William H. Kruskal, who has also contributed a number of other improve- 
ments to the paper. 
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Contributions to 





| 

| | 

| Q T Min A- | Min 4- | Max A- 
| 

















a<b | e2d Max A- 
| < Min Bs Max Bs Min Bs Max Bs 
| 
< | <.1) 4 0 0 0 0 0 
< = 0 1 0 1 0 | 1 
site Vlae 1 0 1 1 1 1 
= < | 0 Onl 0 
| ee pissMued sot 0 I 1 0 
do ee 0 ; 1 1 1 0 | 0 








But in each row the contribution to 4Q+2T is the sum of the contributions to 
the four s’s. (Actually only the first three rows of the table are really needed.) 


Therefore, 
n 
S = ( ) — 2s. 
2 


This proof provides at the same time forma! proofs for the simpler cases, with 
ties in only one or in neither ranking. 


5. TABLE FOR (3), AND FOR ($) AND ($) IN CALCULATING 
T anD U WITH TIED RANKS 























—o- ——— a — 

n (3) n (3) | n (3) n (3) 
2 l 15 105 27 351 39 741 
3 3 16 120 28 378 40 780 
4 6 17 136 29 406 41 820 
5 10 18 153 30 435 42 861 
6 15 19 171 31 465 43 aQ3 
7 21 20 190 32 496 44 946 
8 2 21 210 33 528 45 990 
9 36 22 231 34 561 46 1035 

10 45 23 253 35 595 | 47 1081 

il 55 24 276 36 630 18 1128 

12 66 25 300 | 37 666 49 1176 

13 78 26 325 | 38 703 | 50 1225 

| 


14 91 
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A NEW BIVARIATE SIGN TEST 


IsaporE BLUMEN 
Cornell University 


A sign test for the hypothesis that the medians of two or more varia- 
bles (not necessarily independent) have a particular value is described 
in detail for the bivariate case. The slopes of the vectors from the 
bivariate median to the n sample points are ordered and we let a;= +1 
according as the vector corresponding to the jth slope is above or be- 
low a horizontal line through the median. The bivariate test is based 
on the statistic v* =2(v,2+02%)/n, where v: >a; cos (xj/n), v2 = Ya; sin 
(xj/n), and the sum is over the integers j=1, 2,---, n. The statistic 
has the chi-square distribution for large samples. The power of this 
test relative to other possible sign tests and Hotelling’s T? is discussed. 


1. BACKGROUND 


ET us suppose that there are two measurements of an individual’s behavior 
y in a particular situation. In a factory, for example, one measurement might 
be an index of the quality of the individual’s output while the other might be 
the‘quantity produced. Fifteen machines are chosen at random and the first 
and second shift workers are to be compared. 

If we compare the workers on only one variate, say quality, the problem is 
a standard one [2]. For if there is no effect of shift and if the workers have 
been chosen at random from some relevant population, the quality of the out- 
put recorded on the first shift and the quality recorded on the second shift 
could be viewed as independent chance variables which have the property 
that the median of the differences is zero. That is, if we write a plus whenever 
quality is better on the first shift and a minus whenever it is better on the 
second shift, the fifteen signs can be taken as coming from a binomial distribu- 
tion with n=15 and p=}. 

If quality on one shift were better than that on the other, the median of the 
differences would be different from zero and the signs would follow a binomial 
distribution with p~¥}. Hence, a reasonable test procedure for the hypothesis 
that there is no shift effect is to reject the hypothesis whenever the number of 
plus or minus signs is excessive. 

This test is the grandfather of most nonparametric procedures and shares 
their virtues and vices. From our point of view, the principal virtue is that it 
makes no requirement that the behavior of quality scores on different machines 
must be the same. In fact, it does not even require that the variability of qual- 
ity scores for workers on a particular machine be the same, except that these 
operators have scores which vary about some common level for the machine. 
A machine whose operators produce a highly variable product will not, there- 
fore, dominate this test as might happen if one used the raw differences. (Of 
course, to the extent that the differences actually are homogeneous the sign 
test may not make most efficient use of the data.) 
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If we are to compare the two shifts on both quantity and quality we face an 
added complication in the fact that these are related measures for which the 
relationship is not simple. This dependence makes it impossible to perform sign 
tests on quantity and quality separately and treat the tests as independent. 
An extreme example will make the point clear: Suppose that whenever there is 
a plus on quality differences there must be a plus on quantity differences and 
similarly for minuses. A decision to accept or reject a hypothesis on quality will 
then imply the same decision about quantity. 

In practice, of course, the association is not this close and extra information 
is gained by looking at both variates. When the differences are from a common 
normal distribution, an appropriate statistic is Hotelling’s T? [1]. 

In the event that the requirement of normality cannot be met by the data, 
a bivariate sign test may be appropriate. It is possible to devise many such 
tests and before describing the one which we recommend it may be worth look- 
ing at a few. 

We begin by plotting the differences on a graph. The simplest test then is 
probably to divide the plane up by two lines through the origin. There will be 
four “quadrants” and the two pairs of opposite quadrants can be separately 
considered. There will be n,; points falling into the first pair, nz into the second. 
Of the ni, 8; will fall into one of the quadrants and s; is approximately normally 
distributed under any reasonable restrictions on the way the lines are drawn. 
In particular, we can draw the lines so that n,; =n/2 (for convenience, assume n 
even) and so that the angle containing the n,; observations is as small as possible. 
Then it is not hard to show that 

Ne 3 
4 


Ne 





(1.1) 


has, approximately, a chi square distribution with two degrees of freedom under 
our null hypothesis and will tend to be large when the null hypothesis is false. 
The test then is to reject when L? is large. Unfortunately, although easy to 
apply, the test has much less possibility of picking up the fact that our null 
hypothesis is false than the V* test proposed later since we effectively use only 
half our observations for each possible direction of shift. 

A test proposed by Hodges [3] is not much more difficult to apply. He sug- 
gests drawing lines through the origin and counting the number of observations 
falling on either side of each line. One looks for a line for which the smallest 
number fall on one side and bases the test on this number. Put crudely, one 
relies on the observations to find the combination of quantity and quality least 
favorable to supporting the hypothesis and then applies a sign test. Tables for 
Hodges’ procedure for many reasonable combinations of sample size and sig- 
nificance level may be found in [3]. The theory for the procedure is still in- 
complete. A heuristic argument for the point of view that the procedure has 
relatively little power is given in the last section. 
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2, THE TEST PROCEDURE 

The univariate sign test involves the reduction of the problem of testing the 
hypothesis that the median of a distribution has a particular value to a bi- 
nomial probability problem. It does this by taking as the basic data for the 
statistical test only the directions in which the original observations differ from 
the hypothesized value. There are two possible directions, to which plus and 
minus signs are attached. Our bivariate generalization is based on the direc- 
tions of the vectors from the hypothesized value to (zi, yi). 

In the particular example we are using, the hypothesis is that the median of 
quality and quantity differences between shifts is zero. Thus, if z;’ is the meas- 
ure of quality on the ith machine on the first shift and z;’’ is the measure on the 
second shift, then the difference z;=2,’’ —z;’ is viewed as an observation from 
some population for which the median is zero. Similarly, if we use y to measure 
quantity, the differences y;=y;:’’—y;’ have median zero. Generally, we have 
that the median of zx; cos +, sin @ is zero for any @. 

The artificial data of the first two lines of the table below may be used to 
illustrate the procedure. The first machine on the first shift showed a quantity 
value of 20.4 and a quality value of 15.7. The corresponding values for the 
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shift y;’ 








Second 2,’ . 33. . 1 
shift y;’’ 23. 42. : na.s. 24. 
Differ- 2; 22. 11. ; —3.5 12.7 
ences yi : 19. ; 26.4 -—1.3 











Ratios ys/z|  . 1.69 —.10 —7.54 —.102* 2.96 





*vi<0 


second shift are 10.7 and 35.5. Hence, the difference in quantity is 2,=2,'’ —2z,' 
=—9.7, while y,;=19.8 is the quality difference. Similar differences are com- 
puted for each machine, giving the fifth and sixth rows below. These are to be 
the basic data for the sign test, since the medians of these variates should be 
zero if there is no shift effect. 
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The direction of the vector to (z;, y;) from the origin (with respect to the 
positive x axis) is given if one knows the slope y;/z,; and the sign of y;. Any test 
based on the direction of the vectors can, therefore, be based on these quanti- 
ties. Our last line gives the slopes for each pair of observations, with an asterisk 
to indicate a negative value of yi. 

The probability that y; has an asterisk is one half, since under the null hy- 
pothesis the median of the conditional distribution on any line through the 
origin is zero, and fixing the ratio y;/z; simply fixes on a single line. Hence, if 
we look at the directions of the vectors with similar slopes, there should be no 
preferred direction since a particular vector and the one obtained by a rotation 
of 180° are equally likely. On the other hand, if the medians are different from 
zero, the vectors from the origin will have a preferred direction. 

We can arrange the vectors according to angle with the positive horizontal 
axis, beginning with the smallest and going to the largest angle. For our data, 
this yields .06, .07, .46, .83, .87, 1.69, 2.96, —7.54, —2.04, --1.92*, —1.44, 
—.78, —.61*, —.102*, —.10. 

We can standardize the problem by changing the slopes of adjacent vectors 
so that the angles the slopes represent will be 180/n degrees apart. This stand- 
ardization will give us a test which does not depend on the unit of measurement 
used or on the correlation between the two variates. 

The standardized vectors will tend to point in the same direction if the 
median is not zero and in a random pattern if it is zero. Put another way, if we 
take the intersection of the standardized vectors (prolonged if necessary) and 
the unit circle, then the center of gravity of these points should be near the 
origin if the null hypothesis is true and different from the origin otherwise. 

This center of gravity is obtained by computing the quantities 


L = 180j\° 
v,/n = — > a; cos ( *) 


1 jmd n 


l n—1 
v./n = — >> a; sin 
nm j=0 


—y 


n 


where a; is plus or minus one according to whether or not there is an asterisk at 
the (j+1)st ordered slope. In the above example 


v, = cos 0° + cos 12° + cos 24° + cos 36° + cos 48° + cos 60° + cos 72° 
+ cos 84° + cos 96° — cos 108° + cos 120° + cos 132° — cos 144° 
— cos 156° + cos 168° 
5.064 
sin 0° + sin 12° + sin 24° + sin 36° + sin 48° + sin 60° + sin 72° 
+ sin 84° + sin 96° — sin 108° + sin 120° + sin 132° — sin 144° 
— sin 156° + sin 168° 
5.622 
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In the next section it is shown that if the null hypothesis is true then », and v» 
are, approximately, independent normal variables with mean zero and variance 
n/2. Under the alternative, the means of one or both are different from zero. 
It follows that we can use the chi square tables with two degrees of freedom to 
test our hypothesis, rejecting if 


v? = 2(v,* + 027)/n (2.2) 


is greater than x2”. By way of illustration, if we use the five per cent level in our 
example, we should reject since 


v? = 2(5.06* + 5.627)/15 
= 8.0 > 5.99. 


3. THE THEORY OF THE TEST 


The basic data for the test can be taken to be n independent pairs of observa- 
tions (x1, ¥:), (2, Y2), - * * , (@n, Ya) from a bivariate probability distribution for 
which there is assumed to exist a unique point (yu, v) such that 


Pr {a(X — p) + B(Y — v) > O} = 1/2 (3.1) 


for all real a and b. This point will be called the median. (Implicit in the above 
definition is a continuity condition on the distribution at (yu, v). For ease of 
exposition, it will also be assumed that the bivariate distribution has a prob- 
ability density. There is little difficulty in modifying definitions and procedures 
to suit cases where this is not true.) Like other bivariate sign tests this one is 
based on u, Ue, + + * , U, Only, where u; is the angle which the vector from (yu, v) 
to (z;, y:) makes with the vector determined by some arbitrary vector, say the 
one from (0, 0) to (1, 0). 

With no loss in generality, we can concern ourselves with the test for the 
hypothesis that our median is zero. That is, choose between the hypotheses 


Ho: 22+ v2? =0 (3.2) 
and 
Ha: w+r?>0O. (3.3) 


It seems reasonable to consider only such sign tests as are invariant under 
H» with respect to the scale of measurement used and to rotation of the axes. 
In order to meet these requirements and to provide a reasonably simple test, 
proceed as follows: 


(1) For each i, place a plus sign at the intersection between the unit circle 
and the vector (z;, yi) (prolonged if necessary) and a minus sign at the 
intersection between the unit circle and (—zi, —y;). 

(2) Beginning with any sign and proceeding in a given direction on the circle, 
say counter-clockwise, move the next sign forward or backward until its 
distance along the circle from the first is the arc for 180/n degrees. 

(3) Proceed in this way until the signs are spaced an equal distance apart on 
the unit circle. 
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(4) Obtain the center of gravity of this system, attaching mass plus one to 
the points with positive signs and zero to the points with negative signs. 


The test is based on the distance of this point from (0, 0). When H, is true this 
distance is clearly invariant under rotations and changes in either scale of 
measurement. 

It is easy to obtain the limit distribution of our test statistic under Ho. Let 


wi , 
Vi = >a; cos C + A) 
n 


j=0 


n—1 ~ 
V2 = } a;sin ( + 4), 
j=0 n 

where the angle is measured from an arbitrary but fixed vector determined in 
advance and A is the angle from this vector to the first of the m points deter- 
mined in step (3). Under Ho, the a; are independent chance quantities which 
are plus one with probability one half and minus one with probability one half. 
The distance of the center of gravity in (4) from (0, 0) is 


V(Vi2 + V2?)/n. 


Under Ho, the first two moments of V; are 





E(V,) = E {00s (= + ay} E(a;) 
; n 


j=0 


0 since E(a;) = 0. 


n—1 1 2 
Var (V;i) = {eos (2 oo a) Var (aj). 
n 


j=0 


Now, Var (a;) =1 and 


a 
(cos md +A 


n 


Var (Vi) = 


The distribution of ./2/n V; may be approximated by the normal for suffi- 
ciently large n by virtue of the central limit theorem. That the theorem holds 
for our statistic follows from Liapounov’s condition [1] on the second and third 
absolute moments. For this problem this condition is that 
n—1 T ; \3 
lim n-*/2 pe E a; cos (= d = 0. (3.4) 


n= 2 j=0 n 
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This is clearly true because 


| ™) 13 
| a; cos (° a A) , Be 


j n | 


It will be shown that the limit distribution of V; and V;: is bivariate normal 
by proving that all linear combinations of V; and V3 are normal. This follows 
easily from the fact that we can write 


(3.5) 


n—1 / . 
= > } 008 G + A) cos (C — A) - sin(™ + A) sin (C — Ay} , 


n n 


That the left hand side has a normal limit follows from our discussion for Vj, 
since A was arbitrary. If we take (C—A) to be x/2, it now follows that V¢ is 
normal. Furthermore, if we take C—A so that b,/(b;?+-b,?)*=cos (C— A) then 
b,Vit+beV2 is, except for a multiplicative constant, of the form given in (3.5) 
and also has a normal limit, 

Since the variance of V,; and V; is the same for any A, the limit distribution 
of 2/n (V;, V2) is bivariate normal with mean zero, variance unity, and zero 
covariance. The distribution of the sum of squares of two standardized inde- 
pendent normal variables is a chi square with two degrees of freedom and it 
follows that 


V2 = 2(V,2 + V22)/n (3.6) 


has that distribution approximately. 

An indication of how good the above approximation to the actual distribution 
is can be obtained by comparing the fourth moment of ./2/n V; with the 
fourth moments of standardized normal and binomial variables. A computation 
very similar to that for Var (V;) yields (4/n?) E(V;*) =3—3/n, while the fourth 
moments for the normal and binomial (p= 4) are 3 and 3—2/n respectively. 
The fourth moments of the distribution of V2/n V, and of the binomial will 
agree when there are 50% more observations for the bivariate problem. This 
suggests that the approximation to the V? distribution which is proposed here 
will be satisfactory for sample sizes about 50% larger than those for which the 
binomial is satisfactorily approximated by the normal. 

The exact distribution for V* does not seem attainable in general except by 
enumeration of the 2* possibilities corresponding to +a;, +a2,---, +a, with 
some minor simplifications because of symmetry. An exception is the largest 
value of V? for each n whose probabilities are the same as those of the most 
extreme value of Hodges’ statistic, i.e., n2-"*'. It follows from this that the 
smallest sample sizes needed to get significance for nonrandomized tests at the 
.10, .05 and .01 levels are n=8, 9, and 12 respectively. 

The relative power of the V? test for large samples is now obtained for the 
case where the alternatives are bivariate normal with means different from 
zero. Since there is no real restriction involved, we will assume independence 
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between the two variates and unit variance for each, and following Pitman 
|4], we take the mean of (X, Y) to be (A/-/n, 0) where A>O. In order to 
simplify the argument we will introduce a pair of new chance variables asymp- 
totically equivalent to V; and Vs. 

Let 


W, = DicosU; and W2 = > sin Ui. 
tol tel 
Then, since 
l r 


/2 Tr 0 


A oe 
—~ ——= | cos’ udu 
V 2a 0 


Vx/2n 


cos? ye? +08" u) /2n du 


E cos U; 


2 


it follows that 


P( /= w,) a 3.6 
y} VY 1 2 T. (3.6) 


It is obvious that E(W2) =0. 

The argument used to obtain the distribution of V; and V2 under the null 
hypothesis can be applied here and shows that W, and W; are asymptotically 
independent normal variables with variance n/2. Hence for large but fixed n, 

72==2 (W°+ W;?)/n has the non-central chi square distribution with parameter 
of non-centricity 


mA? 


 reapic a 
° 4 


which can be compared with the corresponding value for Hotelling’s T? which 
is ¢? =A’. That is, when the alternative is bivariate normal Hotelling’s T? re- 
quires about 79% of the observations needed by this sign test to attain the 
same power. 

As was pointed out earlier, the Hodges test seems to be less efficient. At the 
five per cent level, for example, the minimum number for significance is about 
n/2—3+/n/4 for the range for which the distribution is known. If the search 
for the appropriate direction is at all efficient then the power will be approxi- 
mately that of a sign test with 3 in place of 1.96. 

But if the two variates are known to be independent, then a more powerful 
test can easily be constructed. Count the plus observations on one side of an 
arbitrary line. The difference of the count from n/2, divided by /n/4 is normal 
with zero mean and unit variance. Take a line at right angles, obtain the differ- 
ence in count from n/2 and divide by \/n/4. The sum of squares of these differ- 
ences is approximately a chi square variable and the five per cent critical value 
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is, therefore, about six. This test is based on differences of order +/n from n/2, 
as is the Hodges test. But a difference of about /6n from the origin is enough 
in this test as compared to about 3/n for the Hodges procedure. Put another 
way, the ratio of the number of observations needed to get the same power is 
about two to three. 

The requirement of independence for the test just constructed was not neces- 
sary. Had we worked with the standardized vectors there would have been 
asymptotic independence of the orthogonal quantities. The relative power for 
this substitute test compared to 7? is the same as that of the univariate sign 
test relative to Student’s t, namely 2/7. 
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In lifetesting, medical follow-up, and other fields the observation of 
the time of occurrence of the event of interest (called a death) may be 
prevented for some of the items of the sample by the previous occur- 
rence of some other event (called a loss). Losses may be either accidental 
or controlled, the latter resulting from a decision to terminate certain 
observations. In either case it is usually assumed in this paper that the 
lifetime (age at death) is independent of the potential loss time; in 
practice this assumption deserves careful scrutiny. Despite the resulting 
incompleteness of the data, it is desired to estimate the proportion P(t) 
of items in the population whose lifetimes would exceed ¢ (in the absence 
of such losses), without making any assumption about the form of the 
function P(t). The observation for each item of a suitable initial event, 
marking the beginning of its lifetime, is presupposed. 

For random samples of size N the product-limit (PL) estimate can 
be defined as follows: List and label the N observed lifetimes (whether 
to death or loss) in order of increasing magnitude, so that one has 
O0<t)' <b’ < +--+ <ty’. Then P(t) =[], [(N—r)/(N—r+1)], where r 
assumes those values for which ¢t,’<t and for which ¢,’ measures the 
time to death. This estimate is the distribution, unrestricted as to form, 
which maximizes the likelihood of the observations, 

Other estimates that are discussed are the actuarial estimates (which 
are also products, but with the number of factors usually reduced by 
grouping); and reduced-sample (RS) estimates, which require that 
losses not be accidental, so that the limits of observation (potential loss 
times) are known even for those items whose deaths are observed. When 
no losses occur at agez less than t, the estimate of P(t) in all cases re- 
duces to the usual binomial estimate, namely, the observed proportion 
of survivors. 
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1. INTRODUCTION 


1.1 Formulation. In many estimation problems it is inconvenient or im- 
possible to make complete measurements on all members of a random sample. 
For example, in medical follow-up studies to determine the distribution of sur- 
vival times after an operation, contact with some individuals will be lost before 
their death, and others will die from causes it is desired to exclude from con- 
sideration. Similarly, observation of the life of a vacuum tube may be ended by 
breakage of the tube, or a need to use the test facilities for other purposes. In 
both examples, incomplete observations may also result from a need to get out 
a report within a reasonable time. 

The type of estimate studied here can be briefly indicated as follows. When 
a random sample of N values, 7;, 72, - - - , Tw of a random variable is given, 
the sample distribution function F(t) is naturally defined as that which assigns 
a probability of 1/N to each of the given values, so that F(t) equals 1/N times 
the number of sample values less than the argument ¢. Besides describing the 
sample, this F() is also a nonparametric estimate of the population distribution, 
in the sense indicated in 1.2 below. When the observations are incomplete, the 
corresponding estimate is still a step-function with discontinuities at the ages 
of observed deaths, but it can no longer be obtained as a mere description of 
the sample. 

The samples considered in this paper are incomplete in the sense that one 
has given, not a random sample 7), - - - , T'w of values of the random variable 
T itself (called the lifetime), but the observed lifetimes 


t; = min (7T;, Li), #=1,2,---,N. (la) 


Here the L;, called limits of observation, are constants or values of other random 
variables, which are assumed to be independent of the 7; unless otherwise 
stated (in Sections 3.2 and 7). For each item it is known whether one has 


T; 3 li, t; = T; (a death) (1b) 


or 
Li< Ti, t= Ly (a loss). 


Ordinarily the 7’; and L; are so defined as to be necessarily nonnegative. 

The items in the sample are thus divided into two mutually exclusive classes, 
namely deaths and losses. A loss by definition always precludes the desired 
knowledge of 7;. On the other hand, a death does not always preclude the 
knowledge of the corresponding L,, in case the limits of observation are non- 
random and foreseeable. Such knowledge of the L; may have value; for example, 
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it makes available the reduced-sample estimate (Section 3), if one chooses to 
use it. 

The type of sample described is a generalization of the censored sample de- 
fined by Hald [17], and a specialization of the situation considered by Harris, 
Meier, and Tukey [18]. 

The term death has been adopted as being at least metaphorically appropri- 
ate in many applications, but it can represent any event susceptible of random 
sampling. In particular, the roles of death and (random) loss may be inter- 
changeable. By redefining the classification of events into deaths and losses, it 
may be possible to approach the same data from various points of view and thus 
to estimate the survivorship functions P(t) that would be appropriate to vari- 
ous categories of events in the absence of the others. This is familiar enough; 
see for example [6], [13], [16]. 

1.2 Nonparametric estimation. Most general methods of estimation, such as 
maximum likelihood or minimum chi-square, may be interpreted as procedures 
for selecting from an admissible class of distributions one which, in a specified 
sense, best fits the observations. To estimate a characteristic (or parameter) 
of the true distribution one uses the value that the characteristic has for this 
best fitting distribution function. It seems reasonable to call an estimation pro- 
cedure nonparametric when the class of admissible distributions from which the 
best-fitting one is to be chosen is the class of all distributions. (Wolfowitz [28] 
has used the term similarly in connection with the likelihood ratio in hypothesis 
testing). With a complete sample, it is easy to see that the sample distribution 
referred to in 1.1 is the nonparametric estimate on the maximum likelihood 
criterion. The same result is true of the product-limit estimate for incomplete 
samples, as will be demonstrated in Section 5. 

The most frequently used methods of parametric estimation for distributions 
of lifetimes are perhaps the fitting of a normal distribution to the observations 
or their logarithms by calculating the mean and variance, and fitting an ex- 
ponential distribution e~“/*dt/u by estimating the mean life » alone. Such as- 
sumptions about the form of the distribution are naturally advantageous insofar 
as they are correct; the estimates are simple and relatively efficient, and a 
complete distribution is obtained even though the observations may be re- 
stricted in range. However, nonparametric estimates have the important func- 
tions of suggesting or confirming such assumptions, and of supplying the esti- 
mate itself in case suitable parametric assumptions are not known. An impor- 
tant property of these nonparametric estimates is that if the age scale is trans- 
formed from ¢ to t* =f(t), where f is a strictly increasing function, then the cor- 
responding estimated distribution functions are simply related by F*(f(t)) 
= F(t). 

1.3 Examples of the RS and PL estimates. We will consider the following 
situation. A random sample of 100 items is put on test at the beginning of 1955; 
during the year 70 items die and 30 survive. At the end of the year, a larger 
sample is available and 1000 additional items are put on test. During 1956, 
15 items from the first sample and 750 from the second die, leaving 15 and 250 
survivors respectively. As of the end of 1956, it is desired to estimate the propor- 
tion P(2) of items in the population surviving for two years or more. 
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The survival probabilities are supposed to depend on the age (the duration 
of the test) rather than on the calendar year, and hence the data are arranged 
as in Table 460. 

















TABLE 460 
Samples I II 
Initial numbers 100 1000 
Deaths in first year of age _70 _750 
One-year survivors 30 250 
Deaths in second year of age 15 
Two-year survivors 15 








This particular example is such that it is easy to form an estimate P*(2) 
=15/100=0.15 from the first sample alone. This is called the reduced-sample 
(RS) estimate because it ignores the 1000 items tested only during 1956. It is a 
legitimate estimate only when the reduced sample is itself a random sample; 
this will be the case only when (as assumed here) the observation limits (two 
years for the first sample, and one year for the second) are known for all items, 
deaths as well as losses. In the absence of this information, one would have no 
basis for discriminating among the 835 deaths observed before the age of two 
years. One cannot simply ignore the 250 losses at age one year; since only 15 
items have survived for two years, P(2) would then be estimated as 15/850 
=.018, an absurd result. The point is discussed further in 3.1 below. 

We now inquire whether the second sample, under test for only one year, can 
throw any light on the estimate of P(2). Clearly it will be necessary to assume 
that both samples have been drawn from the same population, an assumption 
that the RS estimate P*(2) avoided. At any rate, the estimates of P(1) from 
the two samples, namely 0.30 and 0.25, are not sufficiently different to contra- 
dict the assumption. By combining the two samples, the estimate 


P(1) = P*(1) = (30 + 250)/(100 + 1000) = .255 


is obtained for P(1). (In this case the RS has the same value as the other esti- 
mate to be discussed, the product-limit or PL.) This result exhausts the useful- 
ness of the second sample for the present purposes; how does it help to esti- 
mate P(2)? 

The answer is that there are advantages to using the first (the smaller) 
sample for estimating P(2)/P(1), the conditional probability of survival for 
two years given survival for one year, rather than P(2) itself. This estimate is 


P(2)/P(1) = 15/30 = 0.50, whence 
P(2) = 0.255 X 0.50 = 0.127, 


a very simple example of the product-limit (PL) estimate. The outstanding 
advantage of this strategy is that it works just as well if we are not privileged 
to know that the 750 deaths in the second sample had observation limits of one 
year, because these items are irrelevant to the estimation of P(2)/P(1) in any 
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ease. Other considerations for deciding between the two estimates will be set 
forth in Section 3. 

The discussion of the PL estimate will be continued shortly, in Sections 2 
and 3. Section 3 is equally concerned with the RS estimate, while Section 4 is 
devoted to the actuarial estimates. The remaining three sections consist princi- 
pally of mathematical derivations. Though much older, the actuarial estimates 
are essentially approximations to the PL; they are products also, but typically 
they aim to reduce the number of factors by grouping. (Grouping may or may 
not be possible for the PL itself.) The distinguishing designation product-limit 
was adopted because this estimate is a limiting case of the actuarial estimates. 
It was proposed as early as 1912 by Béhmer [6] (referred to by Seal [26]), but 
seems to have been lost sight of by later writers and not further investigated. 

1.4 Notation. The survival function 


P(t) = Pr(T > 0), (le) 


giving the population probability of surviving beyond ¢, will be used in place 
of the distribution function F(t) =1— P(t) because of its convenience where the 
product-limit estimate and its actuarial approximations are concerned. In 
addition the following functions are defined: 


P(t) =product-limit (PL) estimate of P(t). 
P*(t) =reduced-sample (RS) estimate of P(¢). 
n(t)=the number of items observed and surviving at age t, when deaths 
(but not losses) at ¢ itself are subtracted off. 
N(t) =the expectation of n(t), for fixed observation limits. 
N°(t) =the number of items having observation limits LZ such that L>t. In 
practice this function is not necessarily known. 


For the first reading of the paper it may be desirable to suppose that the 
death of one item and the loss of the same or any other item never occur at the 
same age, and never coincide with an age ¢ at which any of the above functions 
are to be evaluated. This condition can always be met by fudging the ages a 
little when necessary. On the other hand, a regular user of the techniques will 
probably come to regard overt fudging as naive; he will prefer to formalize his 
notation and record-keeping by adopting the conventions already insinuated 
into the definitions of death and loss, P(#), and n(t) above. These conventions 
may be paraphrased by saying that deaths recorded as of an age ¢ are treated as 
if they occurred slightly before t, and losses recorded as of an age ¢ are treated 
as occurring slightly after ¢. In this way the fudging is kept conceptual, sys- 
tematic, and automatic. 

The convention that deaths precede losses in case of ambiguity is based on 
the following sequence of operations, which is clearly more efficient than the 
reverse sequence: Examine a group of items of age to, observe the number 6 of 
deaths since the last examination, and then remove (or lose contact with) a 
number A of the survivors. It may then be convenient simply to record to as 
the age of death or loss of the 5+ items, especially if t) is always an integral 
multiple of a fundamental time interval; in fact, however, the deaths will have 
preceded the losses. 
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The chief exception to the immediate applicability of this convention occurs 
when the losses are random but cannot affect items that have already died. 
Then the possible sequences of occurrence of deaths and losses between exami- 
nation times (assumed to be close together) are approximately equally likely, 
and a reasonable compromise is to assume that half the losses in the interval 
precede and half follow the deaths, as in (4b) below. If the loss of an item is 
compatible with the possibility of its having died (unknown to the experi- 
menter) between the time when it was last examined and the time of loss, an 
item lost in this way is effectively lost just after it was last examined, and the 
convention is entirely appropriate. The disappearance of individuals subject 
to medical follow-up is a case in point. 

The remainder of the conventions concern the treatment of discontinuities in 
the functions listed above. Should the value assigned to n(é), say, for an argu- 
ment ¢ for which it is discontinuous be the right-hand limit n(¢+0) =lim n(t+h) 
as h-0 with A>0, the left-hand limit n(t—0) (the same but with h<0), or 
something else? The superior expressiveness of the notation adopted is illus- 
trated by the relations 


n(t — 0) — n(t) = the number of deaths at ¢, 
n(t) — n(¢t + 0) = the number of losses at ?, 


or equivalently by the formula (3a): P*(t) =n(t)/N°(t), which otherwise would 
not be valid at discontinuities. Analogous conventions are adopted for all the 
above-mentioned functions of t, so that P(t) and P(t) are right-continuous, 
N°(t) is left-continuous, and n(¢) and P*(t) are neither. Other advantages of the 
convention for P(t) and its estimates are the following: (a) According to the 
sequence of operations assumed above, one may record deaths as of age ¢ al- 
though they actually occurred slightly earlier. (b) It makes P(0) =1 if and only 
if no item dies at birth (age zero). This is convenient and natural. 

One other possibility may be mentioned briefly. The assumption that 
P(«)=0, so that the lifetimes are finite with probability one, is necessary only 
for parts of Section 7 und for the calculation of a finite mean lifetime. However, 
in practice there is nc apparent need to contradict the assumption either. If 
half of a sample dies in one day and the other half is still alive after 1000 days, 
one should still report P(1000) (not P(«))=0.50, since the argument 1000 is 
not an arbitrary large number, but the actual duration of the test. 


(1d) 


2. THE PRODUCT-LIMIT ESTIMATE 


2.1 Definition and calculation. Both the PL and the actuarial estimates of 
Section 4 are based on the following general procedure: 


(a) The age scale is divided into suitaDY; chosen intervals, (0. u;), (v1, we), 
- +, as descrived below. (In the exantle of 1.3, there were only two 

such intervals, namely (0, 1) and (1, 2).) 

(b) For each interval (u;1, u;), one estimates p; = P;/P;1, the proportion of 
items alive just » ‘ter u;. that survive beyond uy. 

(c) If tis a division point (it may be introduced specially if necessary), the 
proportion P(t) in the population surviving beyond t is estimated by the 
product of the esti:r.ated p; for all intervals prior to ¢. 
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If step (a) is left relatively arbitrary and approximations or parametrié 
assumptions are accepted in step (b), one arrives at the actuarial estimates. 
The PL is obtained by selecting the intervals in (a) so that the estimation in (b) 
is a simple binomial, without any recourse to assumptions of functional form. 
The condition for this is that within each interval, deaths and losses be segre- 
gated in a known fashion. As a beginning, it may be assumed that no interval 
contains both deaths and losses. Then if the number under observation just 


after uj: is denoted by n;, and 6; deaths are observed in the interval (u;_1, u;), 
the estimate is clearly 


pb; = (nj; a 5;) /n; = n,'/n;, (2a) 


where n,;’ is the number under observation just after the 5; deaths. However, if 
the interval contains only losses (but at least one item survives throughout the 
interval), the estimate is £;=1. 

In the product of conditional probabilities formed in step (c), unit factors 
may as well be suppressed; and we need not be concerned with the manner in 
which the losses are distributed among the intervals, so long as n; and n,’ are 
correctly evaluated in (2a), and no losses occur at ages intermediate to the 4; 
deaths. The situation is illustrated by the following scheme: 














TABLE 463 
No. of items | N ni n,' Ne Ne 
No. of deaths or losses do 5 wy 5. e 
Division points ug =0 uy U2 








Here N is the initial number of items, and the braces join the numbers whose 
ratios are the conditional probabilities (2a). The numbers in the second line 
are the differences of those in the first, the \’s counting losses and the 4’s deaths; 
some of these could be zero. The division points u; are placed in the third line 
to show that the 6; deaths occur between u;.. and u;, while u; is located any- 
where among the A; losses. The relation n,’ <n(u;) <nj4: holds. 

The PL estimate is now given by 


P(t) = [J (n/n), with wu = it, nj = n; — 5). (2b) 


j=l 
If the greatest observed lifetime ¢* corresponds to a loss, (2b) should not be 
used with t>?*; in this case P(t) can be regarded as lying between 0 and P(t*), 
but is not more closely defined. 

If it is desired to permit the entrance of items into the sample after the com- 
mencement of their litetimes, this can be done by treating such entrances as 
“losses” that are counted negatively in \;. The same items can of course disap- 
pear again at a later age and so yield ordinary losses as well. It is assumed that 
nothing is known of the existence of any such item that dies before it becomes 
available for observation; that is, the observation is censored on the right but 
truncated on the left, in the terminology of Hald [17]. 
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The form (2b) was selected for the PL estimate to give the minimum number 
of elementary factors and the maximum grouping of the observations. Never- 
theless, the number of deaths 4; in an interval can easily be as small as unity. 
The resulting estimate /;, though of limited value by itself, is none the less 
acceptable as a component of P(é). In fact, one is at liberty to take the intervals 
as short and as numerous as one pleases, and to regard each death as occupying 
an interval by itself. To specify the resulting expression, one relabels the N ages 
t; of death or loss in order of increasing magnitude, and denotes them by 
ti’ Sh’ < +--+ Sty’. Then 


Pt) = TI [(V - )/(V - r+))], (2c) 


where r runs through those positive integers for which ¢,’<¢ and ¢,’ is the age 
of death (not loss). The cancellation of like integers in numerator and denomi- 
nator where they occur reduces (2c) to (2b). If there are no losses, everything 
cancels except the first denominator N and the last numerator n(t), say, and 
the PL reduces to the usual binomial estimate n(t)/N. (2c) shows that A(t) is a 
step-function which changes its value only at the observed ages of death, where 
it is discontinuous. 

In analyzing data on lifetimes by the multiplication of conditional probabili- 
ties, one of the following three procedures will usually suffice: 

(1) If the number of deaths is relatively small, these deaths may be arranged 
in order of age without grouping, and the numbers of losses in the intervening 
age intervals counted. The PL estimate is calculated by (2c). 

(2) If (1) is too time-consuming but the number of distinct ages of loss is 
relatively small, these ages may be arranged in order, additional division points 
inserted as desired, and the numbers of deaths in the resulting age intervals 
counted. If some of these intervals are shorter than necessary and are found to 
contain no deaths, they can be combined with adjacent intervals. The PL esti- 
mate is calculated by (2b). 

(3) If neither (1) nor (2) is compact enough, then division points are chosen 
without close consideration of the sample, deaths and losses are counted in 
each interval, and an actuarial approximation to the PI., such as (4b), is used. 

As a miniature example of case (1), suppose that out of a sample of 8 items 
the following are observed: 


Deaths at 0.8, 3.1, 5.4, 9.2 months. 
Losses at 1.0, 2.7, 7.0, 12.1 months. 


The construction of the function P(t) then proceeds as follows: 

















TABLE 464 
uj | n, | n;" Aj P(u;) 
0.8 | Ss 7 2 7/8 
3.1 5 4 0 7/10 
5.4 4 3 l 21/40 
9.2 2 1 0 21/80 
(12.1) | 1 1 1 21/80 
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Each value of P(u;) is obtained by multiplying n,;’/n; by the preceding value 
P(uj1). The age 12.1 is recorded in the last line to show the point at which 
P(t) becomes undefined; since it is a loss time, the 12.1 is enclosed in parenthe- 
ses. It is to be inferred from the table that P(5.3)=7/10, for example. The 
third and fourth columns could be omitted since n,;’=n,;—1 (except in the last 
line, which corresponds to a loss) and A; =n,’ —j41. 

A rudimentary illustration of case (2) has already been given in 1.3. A little 
more elaborate example of a similar sort with N = 100 is given in Table 465. 
Here 1.7, 3.6, and 5.0 are assumed to be the only ages at which losses occur; 
they are prescribed as division points. The other division points (1, 2, 3, 4) are 
selected at pleasure, with the object of interpolating additional points on the 
curve of P(t) vs. t. Ng; is the effective sample size defined in (2j) below. In 
practice four columns headed u,, n;, n,’, P; will suffice. From the table one 
infers that .74< P(2.5) <.87, for example. 























TABLE 465 

Interval Factor 

Uj-1, Uj y] nj 5; rj Pi P(u;) Nz; 
0 -l1. 1 100 3 0 97/100 .97 100 
1 -1.7 2 97 5 20 92/97 92 100 
1.7-2 3 72 4 0 68/72 .87 88 
2 =3 4 68 10 0 58/68 .74 83 
3 -3.6 5 58 9 12 49/58 .63 80 
3.6-4 6 37 6 0 31/37 .52 73 
4 -5.0 7 31 15 16 16/31 a ° f 51 





2.2 Mean and variance of P(t). The important facts here, derived in Section 
6.1, are that P(2) is consistent and of negligible bias (unless excessive averaging 
is done; see Section 3.1), and that an asymptotic expression for its variance can 
be obtained. Like the estimate itself, the sample approximation to its variance 
proves to be independent of the limits of observation of items not actually lost. 
However, the variance derived from population values does depend on all the 
limits of observation, which are assumed to be fixed during the sampling. 

It has been noted that if the greatest observed lifetime * corresponds to a 
loss, then for ¢>¢*, P(t) is undefined though bounded by 0 and P(é*). Unless 
the probability of this ambiguous situation is quite small, however, a non- 
parametric estimate of P(t) will not be very informative in any case. The 
ambiguity cannot occur unless the N°(t) items observable to ¢ all die at ages 
less than ¢. The probability of this event is 


[1 a: P(t) |r < e-N*WOPWH) = e-N(t) (2d) 


This is already less than 0.01 when N(é) is only five. 

It is shown in Section 6.1 that if one can supplement the ambiguous case by 
ascertaining the age of death of the item lost at ¢*, or of one or more other 
randomly selected items alive at ¢*, and defines P(t) for t>t* as P(t*) times the 
survival function for the supplementary sample, then the expected value of 
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P(t) is precisely the population value P(t). In practice this supplementation 
would often be neither feasible nor worthwhile, but with otherwise adequate 
data the resulting bias in one or a few samples will be too small to have any 
practical importance. 

It will be shown in Section 6.1 that the variance of P(t) is given approxi- 
mately by 


V3 
V (Pa) ) = P*(t) > (qi/N spi), (2e) 
1 


where the distinct limits of observation L,’ are now used as the division points; 
L’,-1 is the greatest preceding ¢; and pj=1—q;=P(L,;’)/P(L';), with Lo’ =0, 
L,’ =t. After dividing (2e) by P(t), one sees that the square of the coefficient of 
variation (CV) of P(t) is set equal to the sum of the squares of the CV’s of the 
estimates of the p;, the usual approximation for the variance of a product. 

If the sample estimates are inserted in (2e) one obtains 


‘ - ee aT a 1 
P (PO) = Pw) YX [8;/nj(n; — 4) ] = PO X(— -—), (2f) 


3 I 


A very similar formula was derived by Greenwood [15] and later by Irwin [19] 
in connection with actuarial estimates. It is easily verified that (2f) remains 
valid when the number of intervals is reduced to those used in (2b), or ex- 
panded to one interval for each death as in (2c). In the latter case it may be 
written 


V[P) = Pa) dS [(N -—n(N- rt dD), (2g) 


where r runs through the positive integers for which ¢,’ <¢ and t,’ corresponds 
to a death. 
In terms of integrals (2e) can be written 


* | dP(u) | 
0 N°(u) P?(u) 


a f LaP@| 
= Po f as 
0 N(u)P(u) 


V [Pw] = P*@) 
(2h) 





In case of ambiguity the first form should be referred to and interpreted in ac- 
cordance with 1.4. If P(u) alone is discontinuous within the range of integration, 
it should be regarded as a (continuous) independent variable, so that 
JP(u)-*| dP(u) | =1/P(u). 

Since N°(t)<N°(u) <N, it is clear that (2h) is greater than the complete 
sample variance P(t)[1—P(é)]/N but less than the reduced-sample variance 
P(t) (1—P(#)]/N°(@) of (3b). Lf losses are random and the instantaneous rates of 
death and loss among survivors are in the ratio 1 to p at all ages, then one has 
E[N°(u)~1] = [NPe(u) ]-* for large samples, and (2h) reduces to 


V [Pm] = [P9(t) — P2()]/C1 + p)N (2i) 
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It is often instructive to estimate an effective sample size N z(t), which in the 
absence of losses would give the same variance V[P(t)]. Evidently 


Ne(t) = Pw [1 — P)/P (FO). (2)) 
It can be shown that V(t) is nonincreasing as ¢ increases, and that 
n(t)/P(t) < Ne) < D®/[1 — Pe], (2k) 


where D(t) is the number of deaths observed at ages not exceeding ¢. The 
upper bound was proposed as an approximation to Ng(t) by Cornfield [8]. The 
lower bound corresponds to the reduced-sample variance. 

In Table 464 of 2.1 we had P(6) =21/40=0.525. By (2f) the variance of P(6) 
is estimated as 


l 2 
V[P(6)] = (0.525)*( + —— ) = 0.042. 
8x7 5x3 


By (2g) the same result appears in the form 





1 
0.525)? + +). ° 
' ase, 4X5 3X4 


The effective sample size is estimated as 
Ne(6) = (0.525) (0.475) /0.042 = 6.0. 


The bounds for N'2(6) in (2k) are 5.7 and 6.3. Values of Ng are also indicated in 
Table 465. 

2.3 Mean lifetime. The PL estimate g of the mean lifetime i is defined as 
the mean of the PL estimate of the distribution. It is well-known (and easily 
proved by integrating by parts) that the mean of a nonnegative random varia- 
ble is equal to the area under the corresponding survivorship function. Hence 


a -f P(t)dt. 
< 0 


Of course, if P(t) is not everywhere determined, 2 is undefined. In cases where 
the probability of an indeterminate result is small, P(t) is practically unbiased 
and the same is true of Z. 

If we “complete” P(t) in the example of Table 464 by following the longest 
observed individual (with t,’=12.1) to death at t=14.3 we have 

uw = (1.000)(0.8) + (0.875) (3.1 — 0.8) + (0.700)(5.4 — 3.1) 
+ (0.525) (9.2 — 5.4) + (0.2625)(14.3 — 9.2) 
= 0.800 + 2.012 + 1.610 + 1.995 + 1.339 = 7.76. 


If P(t) were “completed” by setting P(t) =0 for the indeterminate range, the 
last term in the above sum would be replaced by (0.2625) (12.1-9.2) =0.761, 
and jj would be estimated as 7.18. 

Of course, if the probability of an indeterminate result is high, there is no 
satisfactory way to estimate yu. In such cases Irwin [19] has suggested that in 
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place of estimating the mean itself, one should estimate the “mean life limited 
to a time L,” say yz). This is the mean of min (7;, L), with L chosen at the in- 
vestigator’s convenience. Naturally, one would choose L to make the probability 
of an indeterminate result quite small. If one chooses to use this procedure he 
should give an estimate of P(L) along with jz). If we take L=10 in our 
example, 


Buoy = 0.800 + 2.012 + 1.610 + 1.995 + (0.2625)(10 — 9.2) = 6.63, 


and P(10) =0.2625. 
In Section 6.2 an approximate formula is given for the variance of i: 


ss = A(t) | dP()| = A*(t)| dP(o) | ; 
} = —— = es 21 
\) f N()P() rf N°(t)P2(t) 2!) 


where A(t) = /* P(u)du. Upon making the obvious substitutions we find, after 
some reduction, the following estimate of V (a): 


A? 
P@) = 2X (N —r)(N —r +1) 


where r runs over those integers for which ¢, corresponds to a death, and 
A,=J, P(u)du. If there are no losses, 





(2m) 


N 
A, = >> (t; — t,)/N, 


i=r+l1 


and it can be shown that V(z) reduces to : (t;—#)2/N*. This fact, plus the 
impossibility of estimating the variance on the basis of only one observed 
death, suggests that (2m) might be improved by multiplication by D/(D—1), 
where D is the number of deaths observed. 

For our first estimate of 7 above we have 


A, = 2.012 + 1.610 + 1.995 + 1.339 = 6.956, 
A, = 1.610 + 1.995 + 1.339 = 4.944, 
As 1.995 + 1.339 = 3.334, 
Az 1.339 = 1.339. 
(Obviously the A, and i are best calculated in reverse order.) We then have for 
the estimated variance, 
“ 6.956)? 4.944)? 3.334)? 1.339)? 
Pq) — (6-956) , (4.944)? | @.334)* | (1.339)? 
7X8 4X5 3x4 1X2 
The estimated standard deviation of i is +/3.91 = 1.98. If the factor D/(D—1) 
=4/3 is included, these results become 5.21 and 2.28 respectively. 
If one is limited to grouped data as in Table 465 it is necessary to use 


actuarial-type assumptions ‘e.g., the trapezoidal rule) to estimate 7 and its 
variance. Thus, we may estimate the mean life limited to 5 time units as follows: 


fis) = (1/2)[(1.00 + 9.97)(1.0 — 0.0) + (0.97 + 0.92)(1.7 — 1.0) 
+-+++-+ (0.52 + 0.27)(5.0 — 4.0)] = 3.76. 
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We observe in conjunction with this estimate that the estimated proportion 
surviving the limit is P(5) =0.27. 


3. THE REDUCED-SAMPLE ESTIMATE Vs. THE PL 


3.1 Alternatives to the PL estimate. The acceptable nonparametric estimation 
procedures known to the writers in the situation considered are variants of the 
PL and of the reduced-sample (RS) estimate defined by 


P*(t) = n(t)/N°(t) = n(t)/[n(t) + D°(O)]. (3a) 


Here N°(t) =n(t)+ Dt) is the number of items with observation limits >t; of 
these, D°(t) die at ages <t¢ and n(¢) survive beyond ¢. A simple example of the 
RS estimate has been given in 1.3. Berkson and Gage [3] have called it the 
“ad-hoc” method. 

If the lifetimes are independent of the observation limits, the reduced sample 
of N°(t) items will be a random sample and P*(t) a simple binomial estimate, 
unbiased, and with the variance 


V([P*()] = Pw®[1 — P®)/N°. (3b) 


It has already been noted that this is equal to or larger than the approximation 
(2h) to the PL variance. As with the PL, the estimate 


= f “P* (dt (3c) 


of the mean life may have to be truncated at the greatest of the observation 
limits. In Section 6.2 it is shown that the variance of u* is given exactly by 
e1- 1— P® 


V(ut) = 2 . a “P(u)dudt, 


In practice one substitutes P*(t) for P(t) in this formula and perhaps approxi- 
mates the integrals. 

Whereas the PL estimate has discontinuities only at observed ages of death, 
it is apparent from (3a) that the RS generally has discontinuities at losses and 
observation limits also. Furthermore, suppose that one of the observation 
limits L is such that all (or a sufficient number of) the corresponding items are 
observed to die prior to L, while other items survive and are observed beyond L. 
Then the decrease in N°(t) (without an equivalent decrease in n(t)) will cause 
P*(t) to increase as t increases through L. Thus P*(t), unlike P(t) and the true 
P(t), is not necessarily monotonic decreasing. However, this appears to be a 
disadvantage only in a psychological sense. It does not seem advisable to avoid 
it, as one could, by a further reduction of the sample, basing the estimated 
P(t) for all t<a suitable Z on the fixed sample of items having observation 
limits> L. 

The expression D°(t)=N°(t)—n(t) in (3a) is not the total number D(t) of 
deaths observed prior to or at age t, but the (usually smaller) number of such 
deaths having observation limits>t. Since P*(t) is unbiased, the estimate ob- 
obtained by replacing D°(t) by D(t) is too small on the average (except in 
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special cases where D(t) = D°(t) with probability one), as has been pointed out 
many times (e.g., [3] and [23]). Examples such as that of 1.3 show that the 
resulting bias can be very great, being in fact limited only by the necessity 
that the expectation be positive. Since D°(¢) is often unknown while D(t) is 
known, an investigator who is unacquainted with the PL or one of its approxi- 
mations is all too likely to fall into the trap. 

Another insidious but instructive characteristic of the illegitimate estimate 
based on D(t) is the fact that it is simply the average of the PL estimates P(t) 
(those that are defined at ¢) obtained by regarding each item as a sample in its 
own right. For a sample of one item with observed lifetime 4, P(é) is unity for 
t<t, and for ¢>t is zero or undefined according as the observation ends with 
death or loss. This is also the RS estimate unless one insists that P*(¢) is unde- 
fined for t>Z, (the observation limit of the item), even though it may have 
been estimated as zero for smaller values of t. 

The moral of this discussion is that arithmetic averages of independent PL 
estimates from very small samples will tend to be biased. On the other hand, 
the averaging of independent RS estimates is quite satisfactory, especially if 
the values of N°(t) are used as weights. To avoid bias one must at least assign 
zero weight when N°(t)=0, thus supplying each estimate with a limit of ob- 
servation (thc maximum of those attached to its items) beyond which it is 
regarded as undefined, even though a value of zero may be indicated at some 
earlier age. 

To summarize, the advantages of the PL over the RS are the following: 

(a) The observation limits need not be known for items observed to die. 

(b) The sampling variance of the PL is usually a little smaller. 

(c) The PL has fewer discontinuities, and its monotonicity may be comfort- 

ing. 


On the other hand, the RS has the following advantages over the PL: 


(a) It is perfectly unbiased, and hence estimates from many small samples 
can be combined by (weighted) averaging. 

(b) It facilitates the estimation of an isolated value of P(t). 

(ec) It may be preferred in some cases of dependence between lifetimes and 
observation limits (see 3.2 below). 


3.2 Dependence of deaths and losses. The assumption that the full lifetimes 
7; are independent of the observation limits L; is sometimes violated, as a re- 
sult of a change in the population sampled, or the conditions leading to the 
event called death, or the method of sampling the population. For example, in 
a study of survival after an operation, a change in surgical technique five years 
before the data are analyzed will affect the survival times only of those with 
observation limit less than five years. When loss is due to unforeseen circum- 
stances, such as patients moving out of the state, the possibilities for depend- 
ence are obvious. Merrell [23] and Sartwell [24] have emphasized this point. 

The estimate of P(t) from the sample as a whole, whether PL or RS, involves 
arbitrary assumptions whose danger is peculiar to the dependent case and hence 
easily overlooked. The PL estimate in effect assumes that for items having an 
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observation limit L less than t, the conditional survival probability P(t)/P(L) 
is the same as that for items whose observation limits exceed t, while the RS 
estimate makes the same assumption concerning the absolute survival prob- 
ability P(t) itself. In many applications, the PL assumption may seem as 
plausible as any; on the other hand, the fact that the results are expressed 
in terms of absolute probabilities may lead one to prefer the RS assumption 
(when there is no feeling to the contrary), if only because its operation is more 
easily visualized. For example, the RS estimate P*(t) (apart from sampling 
fluctuations) can never fall outside the range of the true values for the popula- 
tions sampled, because it is a weighted average, whereas in unfavorable cases 
the PL estimate P(t) may fall outside this range. 

If prior observation limits are given for all the items and the sample is 
large enough, dependence may be inquired into by grouping the items according 
to these limits, making separate estimates of P(t) for each group, and comparing 
the results. If some unanticipated losses occur in advance of the a priori limits, 
the PL method can be used within each group. The group estimates can then 
be averaged with the initial numbers of items as weights to give something 
similar to the RS estimate, if that is desired. 

On the other hand, no dependence can be demonstrated if one’s information 
is limited to the values of the t;=min (Z,, T;) and their classification as deaths 
or losses; in this case the observed rates of death and loss can always be repre- 
sented by a model in which deaths and losses occur independently, as well as 
by many models in which this is not so. However, one usually has or can obtain 
other information that is more or less useful. 

What one would like is to obtain for each item the value of an auxiliary 
variable V (which may be either quantitative or qualitative), such that the 7; 
and L,; are more nearly independent within subsamples defined in terms of V, 
than they are in the sample or population as a whole. This will generally be 
true if V is strongly related to the cause of loss, and independent of the cause of 
death, or vice versa. In this case the subsamples are formed so as to reduce the 
variability of the L, (or the 7;) within them. Even if the auxiliary variable does 
not have the desired properties, it may be worthwhile to get an estimate of 
P(t) by a different route for the sake of comparison. Also, the auxiliary variable 
may be of interest for its own sake, for example, classification of the items by 
starting date would be aimed at detecting a temporal change in the population. 
As pointed out in 3.1, the procedure in the general case is not satisfactory if the 
estimates depend on very few items. 

The innocuous form of dependence that results from testing with replace- 
ment is considered briefly in Section 7. 


4. ACTUARIAL ESTIMATES 


4.1 Estimates using only n, 6, X. It has already been indicated (in 2.1) that 
actuarial estimates, like the PL, are formed by multiplying together a sequence 
of estimates of conditional probabilities of survival through intervals (0, u,), 
(wu, ue), - - +. Unlike the PL, the actuarial estimates will generally be some- 
what dependent on the selection of these intervals. It remains to consider how a 
typical factor p of this sort can be estimated when the number of items n at the 
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beginning of the interval is known to be depleted by 6 deaths and J losses 
within the interval, but the order in which these occur is not known or used. 
The PL estimate is denoted by #. 
As in 2.1, the estimates 
n—A—s (4a) 
-:_———— a 
P n-—-dX 
would be used if all the deaths were known to precede all the losses, and if all 
losses preceded all deaths. ~»spectively. Evidently p< p<@. If p is not known, 
another intermediate val ie is clearly supplied by the well-known “adjusted- 
observed” estimate (cf. [%], [12], [18], [22]) 


n—/2—4 


4b 
n — d/2 (4b) 


a) on 
pe = 


This can be recommended for its simplicity even though 


n—}r/2—6 n—6/2—-rA n-—-b6—X 
sentpcieaereatinen. # (4c) 


n—>/2 n—6/2 n 





Here the second fraction is the corresponding estimate of the probability of not 
being lost (if losses are also random events), and the third fraction is the ob- 
served probability of escaping both death and loss. The only other esiimate in 
this section that fails to give equality here is p“ below. 


The “joint risk” estimate (ef. [6] and [12]) is 


n — 6 — X7 4/4) 
p® = [* =] (4d) 


n 


It is the maximum likelihood estimate when the losses are random, the instan- 
taneous event-rate for losses is a constant times that for deaths, and the only 
data given are the values of n, 6, and \. The above estimates satisfy the rela- 
tion p<p <p™ <p. 

One can obtain the PL estimate # from any of the foregoing by dividing the 
given interval into smaller i:tervals no one of which contains both deaths and 
losses. Since there are reasons (full use of information, absence of arbitrary 
assumptions, essential uniqueness of the estimate) for preferring small intervals 
to large, it seems reasonable to regard # as the standard to which the large- 
intervai actuarial methods are approximations. 

The error incurred in an actuarial estimate c* p can be attributed to two 
sources, namely, the sampling error of , reflecting the favt that 6 is a random 
variable, and the discrepancy between p“ and §, reflecting the unknown ar- 
rangement of deaths and losses within the interval. These sources of error will 
be compared by calculating the varianve of log / in each case, on the assumption 
that n is large compared with 6 and X, and that factors of the form 1+0(n-*) 
can be neglected. On this basis the effect of variation in 6 is indicated sufficiently 
by the square of the coefficient of variation of p“, namely 


5/(n — d/2)(n — 6 — d/2) = 46(2n — 6 — A). (4e) 
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To evaluate the second source of error we note first that the total range of 
possible variation in log / (given 6 and \) is by (4a) 


log p — log p = 45d(2n — 5 — d)-*. (4f) 


However, if the arrangement is random, the extreme values may be unlikely. 
Therefore it will now be assumed that all permutations of the 6 deaths and \ 
losses are equally likely (which happens to be a consequence of the proportional 
event-rates leading to (4d)). In this case log # is minus the sum of a random 
sample of 5 of the 6+ numbers 


1 1 1 
log (1 > —), log (1 + —), ts log (1 {- -——____— ) é 
n—1 n—2 n—6éd—XWX, 


Sampling theory and the Euler-Maclaurin theorem lead to the approximate 
variance 


46d(5 + X + 1)/3(2n — 5 — X)4. (4g) 


Dividing the sampling variance (4e) inte the square of half the range (4f) of 
the grouping error gives 


5x2(2n — 8 — d)-? (4h) 


which may be either >1 or <1. Dividing (4e) into the grouping variance (4g) 


gives 
(6 + A + 1)/3(2n — 6 — X)?, (4i) 


which is always <1 (when n>0). 

The above results suggest that rather large intervals could be used if enough 
was known about the mechanisms of death and loss, and that the principal 
source of error is the probable failure of the various permutations of deaths and 
losses within the interval] to be equally likely, or to fit any other scheme that 
may be assumed. Evidence on this point could be obtained by examining the 
permutations that actually occur. For example, this is what is done in using 
(2a). Otherwise one should probably require at the least that (4h) be suitably 
small. This still assumes that grouping errors, like the sampling errors, are inde- 
pendent from interval to interval. If not, one is thrown back on the ultimate in 
conservatism, which is to assert only that ILa< P<] a; 

4.2 Estimates using average ages. Several estimates will now be considered 
that make use of the individual ages of death and loss, although arbitrary age 
intervals continue to be used. The individual ages enter via the average age A of 
the 5 deaths, and the average age A of the A losses, both measured from the be- 
ginning of the age interval under consideration, which is of length h. 

The first estimate assumes nothing about the losses, but assumes that the 
instantaneous death-rate is constant throughout the interval. The total ex- 
posure to the risk of death in the interval is then (n-5-\)h+6A+AA, the maxi- 
mum-likelihood estimate of the death-rate is 6/[(n-5-A)h+6A4+ AA], and the 
corresponding estimate of the survival probability p is 


p® = exp (—6/[n — 6 — » + (6A + AA)/h)). (4j) 
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By using p™, Harris, Meier, and Tukey [18] have been able to give an estima- 
tion procedure for the more general and more difficult case in which the age of 
death is never known exactly, but only known to fall in some interval, semi- 
infinite or finite, bounded by one or two observation points which vary from 
item to item. 

The PL estimate / can be derived from p® as well as from p“ or p™. Al- 
though the limit of the product of the estimates p® for the subintervals is not 
quite unique, the average value of log p(*) corresponding to 6 deaths occurring 


at the same age is 
: du n—6 
+f —————_—--—-—- = log —a 9 (4k) 
9 n — O(1 — u) n 


in agreement with p=(n—4)/n. Here A(=wh) has been regarded as uniformly 
distributed between 0 and h. However, in other respects the behavior of p® 
does not parallel that of p. If the position of one or more of the deaths is shifted 
toward the end of the interval, p™ is increased, while / is decreased (or else not 
changed). Again, p® does not necessarily agree with / in the simple case of no 
losses. Finally, the extreme values of p®, obtained by putting A=A=0 or A, 
fall outside the interval D, p within which # is confined; in fact one can have 
p™ <p while p=), or p™ > while p=p. 

The following estimates have been constructed with the object of using the 
mean ages A and A to simulate the behavior of p: 


p™ = (n — cd — 8)/(n — cd), (41) 


n-r— 6 Tn—5]}° 
gm a : (4m) 
tL nm—AX n 





>= 1/2+(4—A)/h for |A—A] <h/2, (4n) 
_ for A—-AS —Ah/2, 
=: ] for A-—-A>h/2. 


The value of c is immaterial when 6=0 or A=0. 

These estimates are two different averages of the bounding estimates p and 
p, taken with weights c and 1—c respectively. The quantity c is so defined that 
the appropriate one of the bounding estimates is assigned as soon as |A—A| 
>h/2. This is intuitively reasonable, and is justified analytically by the fact 
that log p and log p™ have essentially the variance (4g) of log , if each death 
and each loss is uniformly and independently distributed in the interval. To 
estimate this variance, one can multiply 


var c = var [(A — A)/h| = (6 + d)/126d 


by the square of the range (4f), since (4f) gives the change in log p(‘) and log 
p® corresponding to a unit change in c. 

Experience may indicate which estimate is preferable in a given type of 
situation. If relevant experience is lacking, it would seem advisable to rely 
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primarily on adequately small intervals, and to minimize the computation 
time by using simple estimates. This suggests that p“ be used when the deaths 
and losses are thought to be arranged at random within the interval, # when 
the interval is chosen so that most of the deaths precede most of the losses, and 
p when losses precede deaths. 

This completes the material that seems to be most important for applica- 
tions. The remaining three sections are devoted primarily to mathematical 
derivations. 


5. MAXIMUM LIKELIHOOD DERIVATION OF THE PL 


In accordance with 1.2 the method of maximum likelihood is viewed as a 
means for selecting the best fitting distribution from a class of admissible dis- 
tributions. It will now be shown that if all possible distributions are admitted, 
the product-limit is the one for which the sample likelihood is a maximum. 

Let T:<7:< ---, <T; denote the distinct ages of death, with 6; deaths 
observed at 7';. Let 4; denote the number of losses in the interval [7';, 7541), 
including any losses at 7; but none at Tj4:. (Here j7=0, 1, ---, k; To=0; 
Ti4:= ©.) The ages of the A; losses are denoted by L;, i=1, 2, - - - , Aj. Only 
in this section, P(t) will represent not the “true” survivorship but rather an 
arbitrary one that is to be determined so as to maximize the likelihood. The 
likelihood may now be written 


ro 


II Pua) 


L. i= 


[Pct = 0) = Pero] TT Pu 


t=] 








.(P(T, — 0) — P(Ts) ae) 
L. tml 
Here P(T;—0) includes, whereas P(7T;) excludes, the probability of death at 
age 7'; exactly. 

Clearly (5a) will be maximized by making the P(L,;“) and the P(7T;—0) as 
large, and the P(T;) as small, as is consistent with the monotonicity of the 
function and the assumption 7';< L; <7;,;. This means that P(L,;) =1 and 


P(T;) = P(Li®) = P(Ti41 — 9) (5b) 
for all ¢ and j. If the common value is denoted by P;, (5a) reduces to 
k 
1] (Pia — Pi) YP with Py =1. (5c) 
j=l 


Now writing pj =1—9;=P;/Pj-1 gives P}=pip. « - - , p; and Pj3.1—P;=n - - - 
p;-19;. With these substitutions (5c) becomes 


k 
II ps-*q5*, (5d) 
‘=i 
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i—j— i=j—1 aek 


1 
n=N- Yx- YL &=LA+4), or n(T; — 0) 
i=0 i=l i=j 

in the notation of 1.4. In this form each factor with a fixed index j is maximized 
individually by the binomial estimate p;= 5;=(n;—4;)/n;, in agreement with 
the PL in (2a). If \,>0, so that the greatest observed lifetime ¢* corresponds to 
a loss, then P(t*)>0 and the likelihood of the sample is independent of the 
values of P(t) for t>¢*. Thus P(t) and the maximum likelihood estimate are 
indeterminate under the same circumstances, and the identity between them is 
complete. 


6. MEANS AND VARIANCES 


In this section the mean of P(t) and the variances of P(t) and of ‘the mean- 
life estimates 4 and y* are derived. The observation limits are regarded as fixed 
from sample to sample. In principle, unconditional results for random loss 
times could be obtained by integration. 

6.1 The PL estimate P(t). Let L1)<Ia< - ++ < [y-1 be the distin't observa- 
tion limits that are less than the age t= Z, at which P(¢) is being estimated. Let 
nj41=n(L;+0) be the number of survivors! observed beyond L;; Nj41= Enj4: 
=N°(L;+0)P(L;); pp=1—q;=P(L,)/P(L;); and 6; the number of deaths 
observed in the interval (L,1, L;], excluding nonzero values uf the left end- 
point, with L)>=0. Then 


Pi) = [1 db; with p; = (n; — 4))/n,. (6a) 


j=l 


Let E; denote a conditional expectation for m,---, n; (and hence also 
5, > ++, 5;.1) fixed. Then £;6;=p;, provided that n;>0. If the condition 2;>0 
could be ignored, one cou!d write 


EP(t) = E(pi ~~ - pr+Expe] = Elf - + psp] 
= prE[p: ~~» proBasprs] (6b) 
= +++ = pm‘: :p = P(t), 


so that P(t) would be an unbiased estimate. The flaw in this demonstration is 
the indeterminacy resulting when (for some j) nj41:=0 but P(L;)>0, an event 
whose probability is bounded by (2d), and which could be obviated in principle 
by obtaining one or more supplementary observations. In the derivations of 
approximate formulas that follow, any bias that P(t) may have is neglected. 

Both Greenwood [15] and Irwin [19] when giving formulas for the variance 
of an actuarial estimate of P(t) actually treat a special case in which the ac- 
tuarial estimator and P(t) coincide. Translated into our notation their com- 
mon argument is the following. 

Suppose we consider conditional variances for n; held fixed at the values Nj. 





1 The expression L;+0 means that losses at L; itself have been subtracted off. 
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Then the J; may be treated as independent quantities and 


E[P(t)] = 11(os + Pe) = pis ++ Det I1(1 id 7) 
j=l e = (6c) 
= P*(t) T1(1 + i} 


- ml 


If we ignore terms of order N;~? we have as an approximation for the variance 


V[P)] = P*@) > ' (6d) 
j=l 7, iP i 

This last expression we will refer to as “Greenwood’s formula.” The calculation 
of the variance with m, - - - , mx fixed can apparently be justified only on the 
ground that it doesn’t matter a great deal; the fixed n; are unrealistic, and in- 
consistent with the previous assumption of a sample of fixed size and fixed 
limits of observation. 

The authors have therefore rederived (6d) by means of successive conditional 
expectations. The procedure is to verify by induction the approximate relation 


i=k 
Ej4:[P*(t)] = pr +: a(t +2 —) pi 4S gue (6e) 
i=j+l a 


which reduces to (6d) when j =0. Applying E; to (6e) has the effect of replacing 
j by j—1 provided that one sets 


By Po B,| jy 


E i410; EE j4.; 


= p;?/Eyni. 
wherein /;/H;,:n; has been replaced by the ratio of the expectations of numer- 
ator and denominator. The fact that 4; and E;,.; are positively correlated im- 
proves the approximation. 
In the special case in which k=2 (all observation limits less than ¢ are equal 


to Z,) and any indeterminacy is resolved by one supplementary obserwstion, 
the exact variance of P(t) can be calculated to be 


2 N'\? N’'m N’, 1 
v{eo) = Po (A+ [i+ (1- TB 


36-Mata-»)) 
N\ ON/NINA nm) OO 


where N is the total sample size; N’ = N°(t), the number of items observable 
to t or beyond; and 


(6f) 








[N’, mi] = 1/E(X-) (6h) 


with X =max (Y, 1) and Y distributed binomially (N’, p:). The methods used 
by Stephan [27] show that [N’, p:1]=N’p:+0(1) for N’ large. Thus the term 
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in square brackets is of order 1+0(1/N’), and (6g) approaches Greenwood’s 
formula (6d) for large N’. 

6.2 Covariances and mean lifetimes. The method of Irwin [19] can be used 
to derive the variance of @ = {F P(t)dt by writing 


E(4*) = Eff PaPoydedu 
6i 
on 2 ff BLP) Pc) |dedu ™ 


If u<v and L, and L, denote u and »v respectively, then apart from bias due to 
indeterminate cases one has 


E[P(u)P(v)] = E[P*(u)Basa(paribase - ~~ de) | 
= PauiPrs * - Pee [P*(u) | (6j) 
= [P(v)/P(u) ]E[P%(u)] = P(u)P()[1 + U(u)], 
where by (2h) and (2g) 
_ fe |adPO| ." 
U(u) -f woos =D lw-9w-r+ybs 


the summation is over deaths at ages not exceeding u, and gives the sample 
estimate. Thus one has appreximately 


V(a) = 2 ff PePouwand 
i % . (6k) 
= 2f A(u)P(u)U(u)du -f A*(u)dU(u), 


where A(u) =.” P(v)dv. The resu't is discussed in 2.3. 

To obtain analogous (but in this case exact) results for the RS estimates 
P*(t) and u*, let u<v as before, and split off two independent subsamples con- 
sisting of the N’= N°(v) items observable to v or beyond, and the N’’ = Nu) 
—N°(v) additional items observable to u or beyond, but not to v. Let the first 
subsample yield 6’ deaths in (0, ») and e’ in (u, v), while the second yields 5’’ 
deaths in (0, u). Also let P:=P(u) and P;=P(v). Then 

E[P*(u)P*(v) | (with u < v) 

ot E[(N’ + NY” —j — 8’")(N’ — . e’) |/N'(N’ + N”) 

- E[(N’ + OF ee 8’")(N’ hae: 5’) |P2/P,N'(N’ + N”) 

= [E(N’ — 6’)? + E(N” — 8")-E(N’ — 8’) |P:/PiN’(N’ + N”) 

= [N”P,? + N’P,(1 — Py) + N’N"P;*)P2/PiN'(N’ + N”) 

= P(u)P(v) + P(v)[i — P(u)]/N%(u). 
Substituting this result in the analogue of (6i) and subtracting u? gives V(u*) as 
in (3d). 


(61) 
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7. CONSISTENCY; TESTING WITH REPLACEMENT 


The RS estimate P*(t), being binomial, is of course consistent. In view of the 
approximate nature of the formula for the variance of P(t), however, a proof of 
its consistency is in order. In fact, by giving closer attention to the errors of 
the approximations in the variance derivation, it can be shown that P(t) does 
have the limit P(t) in probability provided only that N°(t)— «©. However, the 
proof is too lengthy to be given here. Of course, the consistency is obvious 
enough in the special case in which the number k of conditional probabilities to 
be estimated remains bounded as N°(t)-> =. 

Thus far the independence of deaths and losses has been assumed. The 
question of consistency may also be raised in the differing context of “testing 
with replacement.” (Cf. [7], [10], [11], [14].) This is the common life-testing 
situation in which a fixed number of items, say v of them, are always under 
test; when one dies it is replaced by another. The only preassigned constants 
are the duration of the test, S, and the number, », of items on test at all times. 
The number of losses is then fixed at v, one for each item on test at the conclu- 
sion of the experiment, while the number of deaths observed, and hence the 
total number of items, is a random variable. The first » items on test have S 
as their limit of observation; if one of these dies at age t, with t<S, its successor 
has S—t as its limit of observation, and so on. Consistency will mean an ap- 
proach to P(t) as yS—+«, which implies that either y+ «0 or S—- (or both). 

The RS estimate will be considered first. Let 7:1, Tz, - - - , be an infinite se- 
quence of independent lifetimes with survivorship P(f) and let P,(¢) be the 
proportion of 7;,---, 7% that exceed ¢. By the strong law of large numbers, 
lim P,(t) = P(t) with probability one as k—«, for any specified t. Now the RS 
estimate P*(t) (which exists only for values of ¢ in the interval 0, S) can be 
represented as a value of P,(t), with k a random variable. This can be done by 


assigning the lifetimes 7, - - - , 7, to the » items initially tested. If some of 
these items have replacements whose limits of observation S—T, (¢=1, - - - , v) 
are not less than ¢, then succeeding lifetimes 7,4:, T,42, - - - , are assigned to 


these replacements. The process is repeated until no more observation intervals 
of at least the duration ¢ are left. To prove P*(t) consistent, it only remains to 
prove that for any M, Pr (k>M)-—1 as WS. If vp the result is evident. 
If S—+~, then even for »=1, 


Pr(k > M) =Pr(%i+:---+Tu <8 —?t) 
> Pr [T; < (S —#)/M fori =1,---, Mj (7a) 
= {1 — P[(S — #)/M]}™. 
As S—~ the last expression has the limit unity and consistency is proved, pro- 
vided that P(«)=0 (eventual death is certain). 


When S-+~, » is fixed, and P(«)=0, the consistency of the PL estimate 
P(t) follows from the relation (cf. (4a)) 


we hy: < Bli) < n(t) + » 4 n(t) + ¥ (7b) 
Nt) + D’ N%t) + D’ + ¥ N 
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since the number of losses is », P*(t) =n(t)/N°(#) is consistent, N°(t)— ©, and 
the number of deaths D’ having observation limits less than ¢ is a random 
variable that does not depend significantly on S. P(t) is undoubtedly consistent 
also when S is fixed and »-+~, but a rigorous proof of this has not been con- 
structed. 
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INADMISSIBLE SAMPLES AND CONFIDENCE LIMITS* 


Howarp L. JonEs 
Illinois Bell Telephone Company 


A simple procedure is proposed for dealing with situations where a 
possibility exists that a sample to be randomly selected may turn out 
to be unsatisfactory in some respect. The procedure consists essentially 
in computing confidence limits as if the specified risk were aa(1—pwy), 
where au is the risk we are actually willing to run, and py is the a priori 
probability that a selected sample will not be considered to be satis- 
factory. 


INTRODUCTION 


ONFIDENCE limits computed from sample observations in the usual man- 
C ner are perfectly valid, of course, if the underlying assumptions are cor- 
rect. One of these assumptions is that, when the observations are taken, there 
is no possibility that any or all of them will be discarded as unsatisfactory. On 
rare occasions, however, probability samples are selected that are so unusual in 
some respect that a question arises as te whether the sample should be retained 
or whether a new sample should be selected. 

In a previous paper [3], an actual situation was mentioned (p. 64) where 10 
systematic subsamples were selected with 10 random starts, the procedure 
amounting to the selection of 10 arrays without replacement from a total of 
30 arrays. In this particular situation, it happened that 6 consecutive arrays 
corresponding to 6 consecutive serial numbers were selected. It was known, 
however, that there was a high correlation between adjacent arrays. In the 
paper referred to, the situation was summarized as follows: 

“The statistician was then faced with a dilemma. If he used the sample, he could 
compute confidence limits and state that the population mean lay within those limits 
with the assurance that, if repeated samples were selected and analyzed the same way 
and a similar statement were made about the population mean on the basis of each 
analysis, the statements on the whole would be correct a high proportion of the time. 
But at the same time, it would be apparent to almost anyone who looked at the serial 
numbers selected that the particular sample in this instance was quite likely to be one 
of the exceptional cases where the statement would not be true. On the other hand, 
if the sample were discarded and a new sample were selected, the computation of a 
confidence interval would involve the difficulty that we do not know’the risks in- 
volved in a procedure whereby we keep only samples that we like.” 


To avoid situations of this kind, it was suggested that a different kind of 
sampling procedure be employed. In particular it was suggested that the popu- 
lation to be sampled be divided into slices of equal size, and a random sample 
be selected independently from each slice. If such stratification were employed 
it would be extremely unlikely that 6 consecutive serial numbers would be 
drawn in every slice. It would still be theoretically possible, however. Thus, 
while it is desirable to employ sampling procedures such that samples we do 





* Revision of paper presented at a luncheon meeting of the Chicago Chapter, American Statistical A iation, 
November 15, 1956. 
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not like are highly improbable, even those procedures may not completely 
preclude the possibility of selecting a sample that is suspect. The suggested 
stratification is not a completely satisfactory solution to the problem. 

A situation much like the one described may arise in experimental design. 
In an invited paper [7] before the Institute of Mathematical Statistics, 
Youden recently discussed the undesirability of using certain types of ran- 
domly chosen block arrangements of the experimental data. His suggestion is 
that such types be treated as unsatisfactory and excluded from the population 
of arrangements from which a random selection is to be made; but that in- 
ferences be drawn from a satisfactory arrangement, randomly selected, as if the 
selection were unrestricted. Other papers [2, 6] relating to restricted designs 
have been called to my attention by Kruskal. 

The purpose of this paper is to suggest a simple alternative procedure which 
involves no assumption about the characteristics of the samples we exclude, 
and which may help to resolve the difficulty in certain favorable situations. As 
Kruskal has indicated (in private correspondence), the underlying principle 
can be extended to tests of hypotheses, where it can be applied to Type I or 
Type II errors, or both. 

Thomas Hayton of the American Telephone and Telegraph Company has 
pointed out (verbally) that the problem of inadmissible samples is related to a 
problem in constructing tables of random numbers. These tables are com- 
monly subjected to tests for randomness, and rejected or corrected if they fail 
to pass these tests. A question arises as to the possible effect of this censorship in 
computing confidence limits for sample statistics when a censored table is used 
to select the sample. The procedure proposed here can be extended to cover this 
situation, provided (a) the process used to generate the numbers is actually 
random, (b) an entire table of numbers would have been either accepted or re- 
jected, but never patched up, and (c) the user of the table knows the joint 
probability that a table produced by the random process would not be used, 
and that a sample selected by employing the table would not be accepted as 
satisfactory. Condition (a), of course, is never completely realized. 

Another related problem is that associated with wild values, or outliers, in 
experimental measurements where randomness is assumed, rather than de- 
liberately introduced. Suppose, for example, that some measurements are as- 
sumed to be observations of a process in “statistical control”; that is, the meas- 
urements are assumed to behave as though they were random selections from 
a normally distributed universe. We might then want to test the most extreme 
observation for outliers, as by using the ratio of the most extreme 
deviation from the mean of the observations to the standard deviation of these 
observations. The statistic to be used, of course, will depend on the feared 
alternatives. The procedure proposed here could be applied in conjunction with 
a test of this kind, if we agree to discard all observations if the most extreme 
observation fails to pass the test, and if we know the probability of failure 
when the measuring process is actually in statistical control. 


UNDERLYING THEORY 


Let S denote the class of «!! possible samples that can result from some par- 
ticular sampling procedure. Let this class be divided into two mutually exclu- 
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sive subclasses, A and N, such that every possible sample in the class S belongs 
to either A or N, but no sample belongs to both. For our purpose here, A de- 
notes the subclass of all admissible samples, and N denotes the subclass of all 
samples that are not admissible, according to some specified criterion. This 
criterion for admissibility may be anything we choose, as long as it clearly 
divides the entire class S into the two sub-classes A and N. 

Consider a fixed rule of inference, applicable to each possible sample, and 
let a4 and ay denote the proportions of samples in subclasses A and N, re- 
spectively, for which this rule will lead to an incorrect inference. Then the 
probability for the entire class S that a randomly selected sample will lead to 
an incorrect inference is 


a = pyan + (1 — pw)aea, (1) 


where py is the proportion of the samples in class S that belong to subclass N. 
The problem considered here is essentially this: If we know a, what can we 
say about a4? 
Two possible answers to this question suggest themselves: 


Qa s Qa, if QA s an. (2) 


a 





as , for every py and ay. (3) 


"Fie, 
These inequalities follow from the fact that py and ay in equation (1) are 
necessarily nonnegative numbers. 


YOUDEN’S PROCEDURE 


Youden’s suggestion appears to be an application of inequality (2). From 
experience with various types of experiments, he is convinced that for certain 
randomized arrangements in block designs the conditional risk of drawing an 
incorrect inference from experimental data is greater than for other arrange- 
ments. Hence, in making the random selection he excludes or discards the ar- 
rangements associated with the greater risk. 

Suppose we designate an arrangement that is excluded an inadmissible ar- 
rangement. Then in our notation, py is the proportion of the total number of 
possible arrangements that eve inadmissible, ay is the proportion of inad- 
missible arrangements that will lead to an incorrect inference, a, is the propor- 
tion of admissible arrangements that will lead to an incorrect inference, and a 
is the proportion of the entire class of both admissible and inadmissible ar- 
rangements that will lead to an incorrect inference. In most situations, a is the 
probability that appears in the published tables. In effect, what Youden says 
is the following: For a subclass N of possible arrangements that I shall desig- 
nate as inadmissible, and a subclass A that excludes subclass N but includes 
all other possible arrangements, I am sure that ay >«a,. I shall therefore restrict 
my random selection to subclass A. In drawing inferences, I shall rely on in- 
equality (2), and take a4 as approximately equal to the a@ in the published 
tables. 
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In the situation described in the second paragraph of this paper, the question 
of whether or not to keep the sample with 6 consecutive arrays among the 10 
arrays selected actually arose because it was believed that the risk of drawing 
an incorrect inference from samples belonging to the subclass that includes 6 
or more consecutive arrays was greater than the probability in the published 
tables. In other words, it was believed that ay>a, where ay is the risk of 
drawing an incorrect inference from samples of size 10 with 6 or more con- 
secutive serial numbers, and a is the probability in the published tables. The 
sample was therefore discarded, and a new sample was selected, largely be- 
cause of this belief. 

When the subclass of inadmissible samples is well defined in advance of the 
actual selection, it will make no difference whether inadmissible samples are 
excluded before the actual selection, or whether they are discarded after the 
fact. In either case, the exclusion can be defended when the condition following 
inequality (2) is actually true. 

With respect to the question as to whether this condition is actually true, 
however, a statistician as such will often have no opinion. He may have to rely 
on the experience and judgment of someone familiar with the field in which the 
sample study or experiment is to be conducted. On the other hand, the statis- 
tician is frequently consulted in designing and interpreting samples or experi- 
ments for the very reason that his conclusions are presumed to be objective, 
and not influenced by a prior opinion, right or wrong, about the subject matter. 
The statistician is therefore not likely to be completely satisfied with inequality 
(2) in assessing the risk of an incorrect inference. 


THE PROPOSED PROCEDURE 


If a4 is known or can be corveniently estimated from the sample observa- 
tions (as is the case for certain restricted designs) the statistician’s course is 
clear. But there are cases in which a (or its maximum) can be estimated for 
the unrestricted design, while a, is difficult to estimate. For these cases we sug- 
gest a procedure based on the general inequality (3). 

Care must be exercised in applying this procedure. Thus, in the situation 
that was described in the second paragraph of this paper, the final decision to 
discard the sample with 6 consecutive arrays was made after computation 
showed there was less than 1 chance in 100 of getting 6 or more consecutive 
serial numbers in a sample of size 10 that is randomly selected without replace- 
ment from the serial numbers 1 to 30, inclusive. Discarding the sample and 
selecting another could therefore be justified (rationalized may be a better 
word) on the ground that rejecting samples of a type that would rarely be se- 
lected will not have much effect on the proportion of incorrect inferences in re- 
peated samples. While the actual decision to discard was probably correct, the 
reason behind it is not completely satisfactory because the subclass of inad- 
missible samples was not well defined in advance. There may be some question 
as to just what samples would have been regarded as inadmissible, and there- 
fore as to whether the probability of selecting an inadmissible sample was 
actually negligible. 
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The procedure we propose, under ideal conditions, would be as follows: 

1. In designing a random sampling plan or experiment for the purpose of 
calculating a confidence interval based on the probability distribution of some 
sample statistic, make any necessary change in the usual procedure so that the 
probability of selecting an inadmissible sample or design arrangement will be 
small. 

2. As a part of the sample design, proceed in the usual way to decide upon 
the maximum permissible risk of drawing an incorrect inference from the 
sample data or experimental results. Designate this risk as the value of a4. 

3. Before the actual selection of the sample items or of the design arrange- 
ment is made, decide precisely which samples or arrangements are to be re- 
garded as inadmissible. The definition should be so chosen that any sample or 
arrangement will be classified as inadmissible if, in the judgment of the statis- 
tician or the research worker for whom the design is made, the risk of drawing 
an incorrect inference from such sample or arrangement, if selected, will ex- 
ceed QA. 

4. Compute py, the probability of selecting an inadmissible sample or de- 
sign arrangement. When use is made of a published table of random numbers, 
the value of (1— py) should be the a priori probability that neither of two events 
will occur: (1) the random process used to generate the numbers will produce a 
table that would not be published, and (2) the sample or design arrangement 
resulting from use of a published table will not be considered to be admissible. 
Subtracting the sum of the two probabilities for these two events from one 
(thus treating the events as mutually exclusive) will result in a conservative 
value (lower bound) for (1—py). 

It should be understood that questions as to whether a random number table 
has actually been rejected before one has been found suitable for publication, 
and as to whether a sample has previously been found inadmissible, are not 
pertinent to the evaluation of py. The pertinent questions have to do with the 
a priori probabilities that a table will be rejected and that a sample will be 
classified as inadmissible. 

5. Select the sample or design arrangement by the random method incorpo- 
rated in the design, and keep and use it unless it is one of those previously de- 
fined as inadmissible. If it is inadmissible, repeat the selection until an ad- 
missible sample or design arrangement has been selected. Keep and use the 
first such admissible sample arrangement. 

6. In calculating the confidence interval on the basis of the probability dis- 
tribution of the sample statistic that was chosen for this purpose, proceed as 
if the maximum permissible risk of drawing an incorrect inference were 


aa(l m. PN), (4) 
rather than a4. 


ILLUSTRATIVE EXAMPLE 


For illustrative purposes, consider the following situation. We wish to select 
a sample of 40 from a population of size 1000 that is serially numbered from 1 
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to 1000. The measurements of the property in which we are interested are 
known to have a symmetrical, unimodal distribution resembling the normal 
distribution; but the mean and variance of the distribution are unknown. Our 
purpose in selecting the sample is to compute 90 per cent confidence limits for 
the population mean. The population is known to be fairly homogeneous, and 
the most feasible and economical sampling procedure appears to be simple 
random sampling without replacement. The ¢-distribution seems suitable for 
computing the desired confidence limits. A table of random numbers is to be 
used in selecting the sample. The probability that the process used in generating 
these numbers would result in a table not regarded as satisfactory is deemed to 
be negligible. Adjacent individuals in the population are somewhat correlated 
in a positive direction, however, so that the inclusion of two or mere consecu- 
tively numbered individuals in the sample seems undesirable. Aside from this, 
we wish to place no restrictions on the sample selection. 

For this situation, the steps in the procedure we propose might turn out to 
be as follows: 

1. We decide that unless the value of py is too large, as subsequently de- 
termined, we shall follow the sampling procedure that appears to be most 
feasible and economicai: random sampling without replacement. 

2. Since we wish to compute 90 per cent confidence limits, we set 


os ™= 1 — .90 = .10. (5) 


3. Since the selection of two or more consecutively numbered individuals 
seems undesirable, we define samples that include such individuals to be inad- 
missible, and all other samples to be admissible. 

4. We compute py, the probability of selecting an inadmissible sample. This 
can theoretically be done when the definition of an inadmissible sample or ar- 
rangement depends entirely on certain combinations of sample serial numbers 
or letter sequences. For example, if we select a sample of size m by randomly 
selecting m numbers without replacement from the serial numbers 1, 2, - - - , M, 
and if we define an inadmissible sample as one in which the numerical difference 
between any two sample serial numbers is less than some integer d, then the 
value of (1—py) is 


fe )elnalea te ie 





(6) 


m™ m 


If we define an inadmissible sample to be one in which there is at least one run 
of consecutive sample serial numbers larger than r, the value of (1—py) is 


: f{M —-m+1\(M — (r+ 1) M 
STU o gin. leectubananld cab oO 
i= 1 m — i(r + 1) m 

where c is the integer that satisfies the inequality 


< 


vende ; 8 
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The derivation of these formulas is outlined in another paper [4]. In this ex- 
ample, formula (6) leads to 


1 — pw = .20, approximately. (9) 


5. We next select 40 sample serial numbers without replacement from the 
numbers 1 to 1000, repeating the selection if necessary until we obtain a set of 
40 numbers of which no two are consecutive. If the first 5 random numbers we 
select are 265, 482, 747, 395, and 481, for example, we discard all five numbers 
because 481 and 482 are consecutive, and start the selection over again, re- 
peating the procedure as often as necessary to obtain a properly selected set of 
40 nonconsecutive serial numbers. These numbers will designate the individuals 
to be included in the sample. 

6. We compute the confidence limits 


X — tas/+/40, (10) 
X¥ + tas/+/40, (11) 
where, for the sample measurements Xi, Xe, -- +>, X«o, 

1 46 

X = - > Xi, (12) 
1 40 

= = u (X; — X)?, (13) 

a = aa(l — py) = (.10)(.20) = .02, (14) 


and ¢, is the value of t, for 39 degrees of freedom, such that the probability that 
t will not exceed this value is 1—} of .02, or .99. 

By interpolating in a table of the ¢-distribution (e.g., [5], p. 134) for 39 de- 
grees of freedom, we find the value of ¢ such that .99 is the probability that ¢ 
will not exceed this value to be 


t= 2.429, approximately. (15) 


Hence, if X¥ =10 and s*=9, for example, the required confidence limits will be 
approximately equal to 


10 — 2.429+/9/40 = 8.85 (16) 
10 + 2.429,/9/40 = 11.15. (17) 
ADVANTAGES AND DISADVANTAGES OF THE PROPOSED PROCEDURE 


In designing sample surveys, it sometimes happens that the people for whom 
the survey is designed will raise the objection that the sample is “not repre- 
sentative.” When this remark is made, I like to quote a remark by W. E. 
Deming [1] to the effect that “what we select are not representative samples, 
but probability samples.” With this introduction, an explanation of what we 
mean by probability sample and confidence limits is usually sufficient to satisfy 
the skeptics, or at least, to silence the objections. 

On the other hand, we try to use some common sense. And common sense 
tells us that while we can safely accept odds of better than 1 to 99 that the 
mean of a population will lie outside the computed 99 per cent confidence limits, 
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for example, when the wager is made before the sample is selected, we should be 
very foolish to accept the same odds if offered to us by an expert in the field 
after he has been informed as to which individuals appear in the sample. 

The principal advantage of the procedure based on inequality (3) is that it 
enables us, to some extent, to use a little common sense in reconciling this dif- 
ference between a priori and a posteriori probabilities. It takes into account 
the fact that information as to the particular subclass to which the sample 
belongs may be relevant in judging whether a rule of inference is likely to lead 
to an incorrect decision. It provides a mechanism for dealing with those specific 
situations where the serial numbers selected provide such information. 

What we should do, it appears, is discuss the situation with our client, and 
decide before the sample is selected just what combinations of sample individuals 
will be regarded as “unsatisfactory.” Such samples will be classified as inad- 
missible. In most cases, perhaps, the classification will have to be made on the 
basis of a maximum number of consecutive serial numbers, or on a minimum 
interval between serial numbers. While a sample of individuals from just one 
or two of several groups of geographical locations might also be classified as 
inadmissible, it would seem better to remove this difficulty by stratifying the 
sample if this is feasible. The discussion with the client should lead to an agree- 
ment that any sample not classified as inadmissible will, if selected, be retained 
as satisfactory. 

The second advantage of the procedure based on inequality (3) is that it 
avoids any question as to the possible effect of bias in the estimates of the 
mean and variance, due to excluding inadmissible samples, on the computed 
confidence limits. If the quantity a is the true probability that the mean of the 
population will lie outside the confidence limits computed by some fixed rule, 
if applied to repeated samples of a given class from a given population, then 
the quantity a/(1—py) is an upper bound to the probability that the mean of 
the population will lie outside confidence limits computed by applying the same 
rule to an admissible subclass of samples. The statistician thus has an objective 
method of computing confidence limits from a sample that is classified as ad- 
missible. 

A disadvantage of the proposed procedure is that computing the probability 
of selecting an inadmissible sample may be difficult or impossible. To illustrate, 
in: sampling from a discrete universe, it might be desirable to exclude samples 
where all sample measurements happen to be alike, if it were known that the 
measurements in the universe were not all alike. But the probability of selecting 
a sample of that kind may be completely unknown. Even where the subclass of 
inadmissible samples is based entirely on combinations of serial numbers, the 
computation of the probability may be long and difficult. 

Another disadvantage of the proposed procedure is that a confidence interval 
computed on the basis of inequality (3) will be wider than ine computed by 
the methods now in use; or, the samples will have to be larger to obtain a con- 
fidence interval of a specified width. The reason is that the proposed procedure 
takes account of the possibility that the samples classified as inadmissible, 
presumably because the rule of inference is more likely to lead to an incorrect 
conclusion than for other samples, may actually be samples where the rule 
always leads to the correct conclusion. Thus, the proposed procedure takes the 
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most pessimistic view that is possible with respect to the judgment of the 
statistician and his client in choosing those sample combinations likely to lead 
to trouble. It is something like charging a higher insurance premium (than 
would be appropriate for the whole population) to those who pass a medical 
examination because some individuals are rejected on the basis of the exami- 
nation. 

The increased sample size, or the wider confidence interval, is the price that 
seems necessary to pay, however, if the statistician is to compute a confidence 
interval in which he can actually have confidence on the sole basis of his spe- 
cialized knowledge. This price may be exorbitant in cases where there is a fairly 
large proportion of possible samples with respect to which the client would be 
more or less dubious about the probabilities computed by applying a rule of 
inference. 

On the other hand, if an expert in the field is willing to employ inequality (2) 
and state that, in his opinion, the mean of a sampled population lies within 
narrower limits than those computed by applying inequality (3), there would 
seem to be no objection to including his statement in the statistician’s report. 
The client could then take both sets of confidence limits into account in making 
his decision. 

Perhaps the most serious drawback to general use of the proposed procedure 
is the fact that it requires the subclass of inadmissible samples to be precisely 
defined in advance. It is likely to be difficult or impossible to secure a good 
definition of this kind. Ideally, we should know enough about the universe we 
are sampling, about the estimation of limits of sampling variability, and about 
balancing precision against cost and administrative feasibility to select a sample 
design which makes “unsatisfactory” samples highly improbable. In real life, 
however, we must be prepared to find, after the sample has been selected, that 
we have overlooked something so obvious that we cannot defend our design 
without further restriction. 

Nevertheless, keeping the ideal procedure before us as a goal may help to 
minimize the number of unsatisfactory samples. The primary purpose of this 
paper is to help attain that objective. 
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When each unit in a population of N units consists of an z and a y 
measurement, where the population mean ¥ of the z’s is known, then 
the population mean Ff of the y’s is frequently estimated by drawing a 
random sample of n units and using one of the customary biased ratio- 
type estimators, 7 (based on the ratic of the y sample mean and the 
x sample mean), or (based on the sample mean of the ratios of the y 
and x measurements for each unit). The precision of these estimators is 
compared with an exact formula derived here (when N>>n) for the 
variance of an unbiased ratio-type estimator y’, a modification of 7 
and g, introduced in [5]. Conditions pertaining to the population 
moments are obtained under which Var y’ < Var @, and it is also shown 
that Var y’<Var 7 (when n— ~) if and only if the slope of the popula- 
tion regression line of y on z is closer to the population mean of the 
ratios of the y and z measurements for each unit than to 7/X. The 
estimation of the variances of the three ratio-type estimators is dis- 
cussed, and some numerical examples are presented to illustrate the 
comparison of the estimators. Certain special populations are examined 
in detail, and the relation between regression and ratio-type estimators 
is also discussed. 


1. INTRODUCTION 


HIS paper is concerned with a modification of the usual ratio estimators 
‘an leads to an unbiased estimator. This property of unbiasedness is im- 
portant in certain sample surveys involving a large number of strata. The pre- 
cision of the unbiased ratio estimator is compared with the usual ratio estima- 
tors. Both the situation where the sample size in each stratum is small and the 
situation where the sample size is large will be considered in the present paper. 
In most of this paper, no assumptions are made about the population distribu- 
tions (e.g., no assumptions concerning normality appear here), and the results 
obtained do not depend on such assumptions. 
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D. 8S. Robson, in a recent paper [9], refers to the unbiased ratio-type esti- 
mator and, the formula for its variance presented in the present paper (actually, 
he refers to an earlier unpublished version of the present paper), and he com- 
putes the exact variance formula using multivariate polykays. If the size N of 
the population is large relative to the sample size n, the variance formula pre- 
sented in the present paper (which is “exact” when N>>n) is recommended, 
since it is simpler to compute than the exact formula in [9]. However, the 
latter formula is exact in the situation where N may not be large relative to n, 
and should be used in that case if exact formulas are important. 

The existence of bias in the standard ratio estimators obtained from random 
samples has led to the exploration of the possibilities of modifying the sampling 
scheme so that the ratio estimator would become unbiased for the modified 
sampling scheme (see [8] and the articles referred to therein). In the present 
paper, the sampling is not modified (i.e., the case of a simple random sample 
drawn from a finite population is considered here), but the usual ratio estima- 
tors are modified so that a ratio-type estimator is obtained that is unbiased 
for the usual simple random sampling scheme. 


2. THE BIAS IN RATIO ESTIMATORS 


Ratio estimators are frequently employed in sample surveys when estimating 
the population mean FY of a variable y with the help of the known population 
mean X of a variable x that is positively correlated with y. In the case of a 
simple random sample of n pairs (z;, y;) drawn from a population of N pairs, 
various ratio estimators of Y can be formed. The following two estimators, 
which are the usual ratio estimators, will be considered here: 


j= X9/# and 9 = X7, (1) 


where 9, = and 7 are the arithmetic means of the sample of y;, 2; and r;= y;/z; 
respectively (see p. 111 and p. 124 in [1]). In this paper we shall assume that 
z,>0, and that X¥ is known. (In some surveys, when some z;=0, the use of 
larger sampling units, so that the new x; being averages of the original z; are 
positive, can be recommended for certain purposes in some of these situations.) 

One well-known defect of these estimators is the fact that they are usually 
biased by amounts for which only approximate formulas are offered in the 
literature. Often in practical surveys, the bias is a small fraction of the stand- 
ard deviation of the estimate and can be neglected. There is, however, an im- 
portant class of survey designs in which this bias may become considerable. 
If a small sample of (z;, y;) is drawn from each of a large number (k) of strata, 
then it will often be found that the bias in the estimate of each stratum total is 
of the same sign, and the bias in the estimate of the population total will be 
approximately k times that for an individual stratum total while the standard 
deviation only multiplies by /k (see e.g., p. 130 in [1]). Therefore, the mean 
squared error of the estimate of the population total will, in this case, be of the 
order of magnitude k?; while, if the estimate of each stratum total is unbiased, 
the order of magnitude will be k. It is in these situations that the use of un- 
biased ratio estimators is of great advantage. 
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3. FORMUL.S FOR BIAS AND ITS ELIMINATION 


In a note [5], simple formulas for the bias in 9 and # were given, which led 
to the construction of an unbiased ratio estimator. These results may be re- 
stated as follows. By definition of the population covariance, we have the 
equations 


E(9) = ¥ — Cov (4, c) (2) 


x 


E(j) = Y — Cov (2. 2) (3) 
£ 
where the symbol F(z) denotes the expectation for the population of all possible 
sample values of z computed for samples of size n. (This incorporates as a spe- 
cial case (n=1) the usual expectation over the N values of the finite popula- 
tion.) The equations (2) and (3) yield convenient and exact expressions for the 
bias. An immediate consequence of (2) and (3) are upper bounds for the biases; 
viz., 


| Bias in 9| < oyz-o2, and | Biasin §| < ojz-o%. (4) 


These upper bounds are proved (but only as large sample approximations) in 
some textbooks. 

Let us now consider the elimination of the bias in #. Clearly an unbiased 
estimate of Cov (y/z, x) is given by 


(VN-1) 2 _ iit mee BBC) 1, rs 
Wa-b > (r; — #)(xj — #) = Va-b =D (fj — #2). 


(N - 1)n 


N(n — 1) ile, 


is an unbiased estimator of Y. 


4, THE PRECISION OF THE BIASED AND UNBIASED RATIO ESTIMATORS 


The usual variance formulas for the customary biased ratio estimators 9 and 
§, which are available in the literature, are approximate. An approximation for 
the variance of 7 is usually derived as a low order expansion in powers of 
8z=(%—X)/X or by an equivalent approximation (see e.g. [1], p. 115). The 
approximation for the variance of 9 is usually derived as a low order expansion 
in powers of 5x;= (x;— X)/X or by equivalent approximations. With the former 
the 5 are assumed small in absolute value, while with the latter this assumption 
is made for 6z; (e.g. [1], p. 117). Moreover, it is often assumed that N is large 
compared with n(N>>n). In the present paper, we make throughout the as- 
sumption that z;>0; i.e., that —1<éz;. This is the only assumption required 
for the validity of subsequent exact formulas. However, when discussing the 
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relative order of magnitude of certain terms in certain of our formulas, we base 
this discussion on one of two further assumptions; viz., 


(a) | 6x;| «1 
or 
(b) | dz; | «1 and | dy: | <1, 


where 6y;=(y;— Y)/Y. These latter assumptions, the reader will note, can be 
quite restrictive. Deriving an approximate variance formula for y’ under as- 
sumption (b), we obtain 


Var y’ = n-'Y?{-0,2/V? + o,2/X? — 2 Cov zy/XY} (6) 
provided n<N. This formula was obtained by retaining all terms up to and 
including cubic terms in 6z; and éy,. Formula (6) will be identified as the fa- 


miliar approximate formula for both Var 9 and Var # if cubic and higher order 
terms in the respective formulas are neglected. 


4.1. The comparison of the estimators y’ and 9 

In order to compare the variances of the ratio estimators y’ and 4, we de- 
velop exact variance formulas. It is convenient to derive such formulas in 
terms of the joint population distribu‘ion of the 2; and r;=y,/z; rather than 
that for the z; and y;. For the sake of simplicity, we shall continue to assume a 
large or an infinite population (N>>n), although it is possible to derive the cor- 
responding results for finite populations. 

The estimate 9 = X7 is biased by an amount (RX —Y), where R is the mean 
for the population of the r; values. Hence, the estimate will not be consistent 
unless R= Y/X. The exact variance formula for 9 is 

Var 9 = X°o,2/n, 
We can simplify the formula for our unbiased estimator y’ to 
y = Xf+e, 


where 
sp aa Mie wn Sle ine so ling a> Hh 
t=] 


is an unbiased estimate of the population covariance between x and r. The 
exact variance for y’ is therefore 


Var y/ = n-'X%o,? + 2X Cov fc + Var c. (10) 

The second term in (10), after some straightforward calculations, can be 
written as 

2Xn—'E[(Az)(Ar)?], (11) 

where Ar =(x—X), Ar=(r—R). For the third term in (10), we employ stand- 


ard techniques of using the moments of bivariate sample cumulants (see [2] 
and [3]). From this we obtain 





1 oro, on—2 
Varc = = $B [(az)*(ar)*) of _ cnt} : (12) 
n n-1l n-1 
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where o,?=E[(Ar)?] and o,2=E|[(Az)*] are the population variances of r and 
z respectively, and C,,=EH(ArAz) is the population covariance. Combining 
these terms, we have 


n Var y’ = X*o,? + 2XE[(Ar)(Ar)?] + E[(Azx)*(Ar)?| 


< (n = 2)C,s* 
ee om (13) 
n—1 n—1 
which is the exact variance formula for y’. 

In terms of 6x; and dy;, the first term (which is n Var 9) in (13) is of 2nd 
order, the second (covariance term) is of third order and the 3rd, 4th, and 5th 
terms are of 4th order. Hence, if the coefficients of variation of x and y are 
small (assumption (b)), the 2nd term will be the important one when comparing 
Var y’ and Var 9. This term will be negative if the array variance of r given z, 
diminishes with increasing x (see Section 6 for some data that illustrate this 
case). In this case, the estimate y’ is to be preferred to 9. 

In situations where the coefficients of variations are not smaller than say }, 
we may not be able to neglect entirely the 3rd, 4th and 5th term in (13). Noting 
that 


(14) Var (ArAx) = E[(Ar)*(Az)?] — C,.*, (14) 
and introducing the regression coefficient 
8 = E[(Ar)*Axr]/Var (ArAzx) (15) 


of Sr on the product ArAz, equation (13) may be rewritten in the form 


ain wt 1 
n Vary’ = X°o,? + Var (ArAxr) {2X8 +1} + : (o2%0,? + Cys). (16) 


ni—- 


The third term in this formula is not larger than 2¢,’c,?/(n—1). Hence, this 
third term represents a fraction not larger than 20,2/X?(n—1) of the leading 
first term X?o,?, and will usually be negligible. If this term is neglected, the 
criterion as to whether Var y’ < Var # or vice versa will depend on whether or 
not 6< —1/(2X). Thus, if the array variance of r diminishes with increasing z, 
this will indicate that E[(Az)(Ar)?] <0; ie., 8<0. If 8<O and X is large when 
compared with | Az], then it will be found, in many cases, that 8B< —1/(2X). A 
clearer indication as to whether the inequality Var y’ < Var 9 is satisfied can be 
obtained by computing unbiased variance estimates for both y’ and 9 as indi- 
cated in Section 5. 

In this section, we have indicated when Var y’< Var 4%: If it is, then y’ is to 
be preferred to 9 since y’ is unbiased and also has a smaller variance. However 
when Var y’>Var 9, 9 is not necessarily to be preferred to y’, since the bias of 
49 may be large. In terms of 62; the bias is of the first order, and it does not 
diminish as n increases. Hence, except in the case where R= Y/X, the bias of 
§ must also be considered when comparing this estimator with y’. 


4.2 The comparison of the estimators y’ and 9 


For the comparison of the variances of 7 and y’, we shall introduce moment- 
coefficients for both the joint distribution of y and z as well as z and r. Let us 
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first simplify the exact variance formula for y’. Since 
Var (y — Rx) = Var (ArAr + XAr) 
Var (ArAr) + X%e,? + 2XE[Ax(Ar)*] (17) 
oy? + R%o,? — 2RCyz, 
equations (13) and (16) may be rewritten as 
n Var y’ = (04? + R%o.? — 2RCyz) + (n — 1)-(0,2o22 + Cre”), (18) 


which is a simple form for the exact variance of the unbiased ratio estimator y’. 
Let us now consider the mean square error E(j—Y)? of 9. We have 


x 


> — 52)? 5 — bz) 2 
MS.E. = E(5 — Y)? = °° {- = = YE {. } (19) 
1+ 82 


where 69=(9—Y)/Y and 6#=(%—X)/X. Hence, the mean square error is 
equal to the value, at argument @=1, of the function 


55 — | 2 (20) 


fe) = ret: + 05% 


and may therefore be represented by its Taylor expansion, at @=0, with exact 
3rd derivative remainder term. This expansion yields 


Y-*(M.S.E.) = E(59 — 62)? — 2E{ (89 — 52)%62} 
(69 — oer 
(1 + 65z)5 J 


Since the 4th derivative f (@) is >0 for 0<@<1, f* (@) is monotonically 
increasing and the third order remainder term must lie between the two values 
obtained when putting 6=0 and @=1 respectively; i.e., between the limits 


L, = — 4E{ (89 — 52)*(52)*} 





+ 3E{ (59 — 5#)2(8z)?} — sr} (21) 


and 





I,=—4E ‘ (22) 


(69 — ee 
(1+ 625 f° 


The expansion (21) proceeds in powers of 4 the first term being of order 0 
and the last of 3rd order. For large values of n, both 6% and 49 are of order 
1/\/n. Therefore, the leading term in (21), which is of order n-, yields the 
asymptotic large sample formula for Y~-? (M.S.E.). Since the square of the 
bias (Cov? (#/Z, #)) is clearly of order n~*, the leading term in (21) also yields 
the large sample formula for Y~* Var 7. Thus as n>, 


Var 9 — Y°E(é5 — 5%)? = n—(o,? + Q%o,? — 2QCyz) (23) 
where Q=Y/X. 
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The corresponding large sample formula for Var y’ is from (18) obtained as 
Var y’ > n—(0,? + R2o,? — 2RC,z). (24) 


Hence, asymptotically for large n, we have 


Var 9 — Var y’ = n-'o.*{(Q — B)? — (R — B)?*} (25) 


where B=C,,/c,” is the regression coefficient of y on x. Equation (25) shows 
that when n is large, Var § will be larger than Var y’ if and only if the regression 
coefficient B is nearer to R= E(y/z) than to Q=Y/X. We also find that Var 9 
will be larger than Var y’ if and only if line Y+R(x—X) “fits the scatter dia- 
gram of y on x” better than the line Y+Q(x—X) =Qz. In particular, the vari- 
ances agree if R=Q; i.e., if the ratio estimator 9 is unbiased. When RQ, the 
regression coefficient B will, in our experience, often be closer to Q than to R, 
and in such cases the large sample formula for Var y’ will be larger than that 
for Var 9. 

When the sample is large, the difference between the variances of the two 
estimates can be estimated by 


n-'s,2{(q — b)? — (# — b)?} 


where s,? is the usual unbiased sample estimate of the variance of z, b is the 

usual estimate of the regression coefficient of y on x, and g=74/Z. The variances 

of both 7 and y’ can be estimated in a similar manner using the appropriate 

estimates of the terms in equations (23) and (24). The appropriate estimate of 

the terms in (23) can be obtained using a similar method to that given on p. 

119 in [1]. Using this method, an estimate may also be obtained for (24). 
The difference between the variances of § and y’ can be written 


Var 5 — Vary’ = n-'c°(Q + R — 2B)(Q — R) 


o a (26) 
= n—e°C2(Q +kK=- 2B)/X, 


which sbows that, under assumption (a), this difference is of the order of 
magnitude n-'43. When n is small, it does not seem possible to derive a simple 
criterion for the comparison of Var f and Var y’ in the general case, but some 
special cases are considered in Section 7. 


5. THE ESTIMATiON OF THE VARIANCE OF y’ 


In this section, two types of formulas for variance estimation are developed. 
The first is directly derived from the formula for the variance (10) and arises in 
the orthodox way of replacing each term in this formula by an unbiased esti- 
mate from the sample values. We again assume an infinite population and make 
use of bivariate k-statistics, which are unbiased estimators of the corresponding 
population cumulants (see [2] and [3]). With the help of this theory, the 
following results can be assembled: Defining by 


$2 = (n— 1) D (rs — M82 = (Wn -— 1) D (i — 9), 
c= (n—1)7° DS (a — (ri — 9) 
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the sample variances and covariance, and by ¢,*, ¢.* and C,, the corresponding 
population parameters, we have 





E(s,*) = 
1 1 n 
E(c?) = — Kn + o,20,? + ——— C,,’ (27) 
n n—l —1 
1 2 
E(s,*s8,7) oo = 6 + o,°02" + Sn = Gost ’ (28) 
n — 
where the cumulant Ky is defined as 
Ky = E[(Ax)*(Ar)?] — 20,7 — 2C,2?. (29) 


An unbiased estimate of the cumulant Ky» is the k statistic kx» (see p. 281 in 
[6]), where 

















n 2(n + 1) 2(in + 1 
n+ 1)Se — SS — ————— Si2Sw 
” (n — 1)(n — 2)(n — 3) mat " n — n _— 
(n — 1) 2 2(n — 1) 8 
_ S2oSee — ———— Sy? — — SySpSa (30) 
n n n 
2 2 6 
+ — SpoSou? + — SS? — — S.8at} 
n n 10 
and 
Sy S > a'r? 
t=] 
We now note that 
and introducing 
c= > (r; — #)*(x; — 2), (32) 
r arg 1) toe] 
we find that 
‘ n—2 
E(c’) = ee E[(4r)*Az]. (33) 


Equations (27), (28), (29) and (31) permit the calculation of unbiased estima- 
tors of E[(Az)*(Ar)?], 2c, and C,,?, and hence of Var c. Equations (32) and 
(33) yield an unbiased estimator of E[(Ar)*Az]. Combining these results, we 
have, for samples of size n>3, formulas for unbiased estimators, v(#) and 
v(y’), of the variances V(#) and V(y’) respectively; viz., 


no(9) = X°s,2, (34) 
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and 
v(y’) = n-'X%s,? + 2Xe’/(n — 2) 


2 
(n — 1)s,?8,2 + (n — 3)c? + (1 - -) (n — 1)kee 
+ “ (35) 


n?=—n—2 





where the computational formulas for c, c’ and kz» are as follows: 


n(n — lhe =n Dy — (DL 2td(D rd, 


(36) 
(n—1)e = Dysri-2 Dy tH Ya; — (n — 14s 
and ke» is computed from (30) remembering that in our case 
Sx = b> y.’, Sn = pie Yi, Sy = : ® Yi, Sx = 7 2, Se = a ri? (37) 


Su = > vs, Sw = > 2; Sa = > ri 
The formula for v(y’) is computationally cumbersome, and it is therefore of 
interest to develop a simpler method of variance estimation. This second 
estimate of variance, although again unbiased, will usually be affected by a 
larger sampling error. Moreover, with this method, it is convenient to modify 
the sampling scheme as follows: Suppose that we are sampling from a popula- 
tion of N pairs (z, y), and that n=2m is even and m>2, Draw a first random 
sample of m pairs (x;, y;) without replacement. Next replace the whole sample 
and draw a second sample of m pairs (z;, y;) without replacement. This method 
of sampling makes the two samples of size m independent. (For the procedure 
of splitting at random the sample of size n into two samples of size m, this will 
not be the case.) If n<N, the two samples will usually have no units in common. 
Very occasionally, however, the samples may have one or more units in com- 
mon, and we specify that if the second sample is completely identical with the 
first it must be rejected and a new sample drawn. We may think of the above 
scheme as follows: From the original population of N pairs of (z, y) form a 
superpopulation of the (“) possible samples of size m, and draw two of these 
samples without replacement. If we denote by %:, #1, 71 and Z, J. and 7 the 
sample means of the z,, y; and r; for the two samples, we may form the two 
unbiased ratio estimators 


N-1 
y’ = Yn +( W \(— ae — #,%) 


1 _ Ws e ~_ *)( m )o ie te 
= T —— " 
Y2 Af, + N i J2 — FeX2) 


Hence, 9’ = $(y:’ +42’) is an unbiased estimate of Y, and an unbiased estimate of 
the variance of 7’ is given by 


een 


(y:’ — ye’)?4 — 


Crt 
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The finite sampling corrections [(N —1)/N] and certainly 


Un) -21/ (a) 


can usually be replaced by 1. The unbiased estimate of the variance is based 
only on a “single degree of freedom.” If more “degrees of freedom” are desired, 
K samples of size n/k could, of course, be drawn (K >2). The disadvantage of a 
“single degree of freedom” (i.e., K=2) is eliminated to a certain extent in 
stratified sampling where the number of strata are large. 

This method of estimation may be illustrated by the following example 
where the population size N=400 and X¥=2. Two samples of size m=2 are 
drawn as follows: 














Ist Sample 2nd Sample 
z y r z y r 
1 3 3 4 8 2 
2 6 3 1 2 2 
#=1.5 n=4.5 n=3 #:=2.5 n=5 r=2 


Ignoring finite sampling corrections, we have that y,’=6, y’=4, 7’=5, and 
the estimated variance of 9’ is }(6—4)?=1. 


6. SOME EXAMPLES 


The comparison of the variance of the unbiased estimator y’ and the biased 
estimator 9 is critically dependent on whether the array variance of r decreases 
with increasing z or not in the (z, r) scatter diagram. In Fig. 501, we have set 
out an example of such a scatter diagram. Fig. 501 deals with a sample of 
49 large U. S. cities (see p. 113 in [1]) drawn from a larger population of 
N = 196 cities. For this figure, z;= population size in 1920, y;= population size 
in 1930 and r;=y;/z;=relative population size. Fig. 501 indicates that the 
array variance of r diminishes with increasing z; i.e., the variation in relative 
size is more stable for the larger cities. Although little is proved by this isolated 
example, it is clear that many situations in which ratio estimators are ap- 
propriate are in fact represented by models in which the conditional variance 
of r given z is decreasing with x. Among these models are the following: 


(a) z=size of population 
y=number of population having a certain character (say sickness) 
r=y/z rate (say sickness rate). 


If the binomial distribution applies, Var (r| x)=2z~'r,(1—r.) where r, is the 
array mean of r given z. This variance will normally decrease with increasing z. 


(b) The array variance of y given z, Var (y| 2), is constant. In this case, 
Var (r| x) will be proportional to 1/z?. 

(c) Var (y| z) is proportional to x. In this case, Var (r|z) is proportional 
to 1/z. 
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Fic. 501 U. S. Cities: Relative Population Increase, 1920-1930. 


The following model is an illustration of a case where the conditional variance 
of r given z is constant: 


(d) The case where Var (y| x) is proportional to z? is sometimes one in which 
r is independent of x and the ratio estimator 9 is unbiased. In this case 
Var (r| x) is constant, and Var y’ will usually be larger than Var 9. 
We shall discuss some of these models in more detail in the following section. 
In order to illustrate the methods given in the present paper, we have used 
the sample data for the 49 cities listed in [1] and plotted in Fig. 501. For this 
population of N = 196 cities, both Y as well as X¥ are known. We have Y = 149.8 
and X=116.9. Let us first ope § and y’. Using the known value of X, 
estimates of Y are 


g=2141 y = 152.1. 


(One reason for the poor estimate obtained from 9 is the very unusual growth 
of a city from 2,000 to 50,000 for which r= 25 is “off the graph” in Fig. 1GH.) 
lf N is taken as ~ and if formulas (34) and (35) are used to estimate the 
variances of # and y’ respectively, from the sample of 49 cities, we obtain: 


v(9) = 3248 v(y’) = 113.7. 


Hence, for this kind of data, the estimate y’ is better than 9. 

We shall now compare the ratio estimates y’ and 7. In Section 4.2, we noted 
that for large samples Var # will be larger than Var y’ if and only if the re- 
gression coefficient B of y on z is nearer to R than to Q=Y/X. For the data 
plotted in Fig. 501, we find that j= 144.9, and B=1.2, R=1.8, Q=1.2. Hence, 
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for large samples, Var 7 will be smaller than Var y’. We find in fact that 
v(¥) = 12.7, where the formula for »(¥) is the usual one (see p. 119 in [1]) except 
that the finite sampling correction has been omitted here for the purpose of 
comparison with v(y’)=113.7 computed from equation (35). The formula for 
v(¥) is based on estimating the terms in equation (23), and similarly an estimate 
of v(y’) may be obtained by estimating the terms in equation (24). The esti- 
mate of v(y’) obtained by this method is v(y’) = 112.0, which is rather close to 
the unbiased estimator obtained using equation (35). 

In the previous example, we found that y’ was better than 9, but the variance 
of # was much smaller than that of y’. We shall now consider a numerical exam- 
ple in which y’ is better than both 9 and 7. The production consists of four (z, y) 
pairs: (2, 2), (2, 6), (4, 6), (6, 10), and a random sample of two pairs is drawn. 
In this case, B=1.45, R=1.79, Q=1.71. Since we are dealing with a sample 
size of 2, the large sample formulas would not give good approximations. The 
exact numerical results are presented in the following table, in which we give 
bias and variance for each of the four estimators of Y; viz., 9, 9, y’, and 9. 












































Mean | Bias Vari- Mean Square Error 
ance 
1 1 1 1 5 10 20 50 100 
“3, ‘s 

6.09 09 .909 .917 .189 .098 .053 .026 017 
6.27 .27 2.233 | 2.407 .520 . 297 .185 .118 .096 
6.00 .00 .563 . 563 112 .056 .028 O11 .006 
6.00 00 2.667 | 2.667 .533 . 267 .133 .053 .027 





We see that y’ has a smaller variance and bias than the other estimators. 
Also, if we assume that there are a number of strata (k) that have the same 
(z, y) population values, then the relative advantage of using y’ for each 
stratum, rather than the other estimates, increases as k, the number of strata, 
increases. 


7. SOME SPECIAL CASES 


In Section 4.2, we saw that the large sample variance for y’ is n—! Var (y—Rz), 
and the large sample variance of § ism~' Var (y—Qz). The estimate 9 is consist- 
ent only in the case where R = Q, and the variance of fisn—! Var { (y—Rz)(X/z)}. 
The comparison of 9 with y’ and 9 (when R=Q) will depend on the form of the 
conditional variation E{(y—Rz)*|z} of y around Rz, given z, and the dis- 
tribution of z. We shall study several forms of E { (y—Rz)?| x} or of the condi- 
tional variation F { (r—R)?| x} =g(z) = E{ [(y—Rz)/z]?| x} of r given z. 

We shall also make some assumptions that are often quite reasonable in 
many situations where ratio estimators are appropriate. It has been pointed 
out (see e.g., p. 196 in [4]) that there will be no bias, or only a trivial bias, in 
ratio estimators “when the regression of z and y is reasonably well represented 
by a straight line through the origin.” If the line of relationship between y and 
x departs in a significant way from going through the origin, the variance of 
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the ratio estimators may be large. Because of this possibility of large variance 
and/or bias, the use of ratio estimators seems most appropriate when the line 
of relationship does not depart in a significant way from going through the 
origin. We shall therefore assume in this section that the expected value of y 
given x, E(y| x), may be described by a line going through the origin. In other 
words, we shall restrict our study in this section to the special case where 
E(y| x) =az, where a is an unknown positive constant. In this case, the three 
ratio estimators y’, §, and # are unbiased ([1], p. 123), and the exact formulas 
for the variance and efficiency of these three estimators may be obtained. 

In the case where the form of the conditional variation Z { (r—a)?| x} =9(z) 
of r given x is known, the most efficient linear unbiased estimate of Y is 


y’ = XD [ri/g(ed] / dX W/o) I. 
t==1 t=1 
The exact variance of y’’ is 
Var (y’’) = Xe {1 > [v/oteal (we assume g(z;) > 0). (38) 
t=] 


Since y’’ has a smaller variance than any other linear unbiased estimate (i.e., 
linear function of r; for which the conditional expectation of the function, 


given 21, 22, - - - , Zn, is equal to Y), we may write the formulas for the efficiency 
of the ratio estimators relative to y’’, When n approaches infinity. we have that 
n Var (y”) ~ X?/E{1/g(z)}, (39) 


where E{1/g(x)} is the expected value of 1/9(z) when a single z value is 
drawn at random from the population. 
We shall now determine the exact variance of the ratio estimators. Writing 


, = y 2. cars / 2, %% 
t=} 





x3) = Y, 


n n 2 
Var (5 | zi) = X? » » gtedze /( D.) , (40) 


we have that E(# 


and 
n n 2 
var (5) = ¥| ¥ tadee /( Lax) |, (41) 


where E(e| x) and Var (z| x.) are the conditional mean and variance, respec- 
tively, of z given 2, 2, Zs, -- * , Z., and Var (z) is the unconditional variance 
of z. 

Writing 


I= ¥ Eri/n, 
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we have that E(#| 2) =Y, 


Var (9| 2) = ¥* D g(20)/n, (42) 
and 
Var (9) = X*E[g(zx)]/n. (43) 
Writing 


y' = Xr+ n( > (raxs/n) — is) / (n — 1) 


= > r{(X/n) + [@ — d/n — DI}, 


t=l 


we have that 


Var (y'| 2) = Doed{(R/m) + [—a/m—H]}, (4) 
and 


Var (y’) = XE [o(a)]/n + 8| SX olad(es — 2] / (nm - 0 


(45) 
+ 2¥E| Le ox) (as _ | / nm — 1). 
Writing the sample mean 
9 sa ; ria;/n, 
tol 
we have that 
E[(5 — Y)*| 2] = E{[(9 — a) + (az — Y)}*| 24} 
(46) 
= 2 o(x)x2/n? + a(z — X)?, 
t=] 
and 
Var (9) = {E[g(z)z*] + a? Var (z)}/n, (47) 
where E[z| x;] is the conditional expectation of z given 2, 22, 23, - - - , Zn, and 


E|[g(z)z*] is the expected value of g(x)z? when a single z value is drawn at 
random from the population. 
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When n approaches infinity, we have the following results: 

n Var (j) = n Var (y’) ~ E[g(z)x*] 

Eff(5) ~ Eff(y’) = X*/E[1/g(x) |E[g(z)z*] 

Eff(9) = 1/E[1/g(x)] Eig(x)] 

Eff(3) ~ X*/E[1/g(x)] { Elg(x)x?] + a? Var (z)}, 
where Eff(z) denotes the efficiency of z relative to y’’. Hence when n is large, 
the efficiency of f and y’ are approximately equal. 

Let us now consider some of the special forms of the function g(x) mentioned 


in the previous section; viz., (1) g(x) =b/z?, (2) g(x) =b/z, and (3) g(x) =b. 
In case (1), 


(48) 


Eff(3) ~ Eff(y’) ~ X*/E|z*] 
Eff(9) ~ 1/E[z*]E[1/2*]. 


Hence, Eff($) ~ Eff(y’) ~1/(1+D?), where D is the coefficient of variation of 
z; that is D=+/|E(x*) — X*|/X*. We therefore see that f and y’ are almost 
efficient when the coefficient of variation D of zx is small, but that these esti- 
mators may be inefficient when D is large. For example, when D=1.01 the 
large sample efficiency of 7 and y’ is less than 50%. The large sample efficiency 
of § and y’ will be greater than the efficiency of 9 if and only if X*E[1/z*]>1 
in case (1). 


(49) 





In case (2), 
Eff(3) ~ Eff(y’) ~ 1 (50) 
Eff(9) ~ 1/E[x]E(1/z). 
Hence, the estimators 7 and y’ are efficient in case (2). 
In case (3), 
Eff(5) ~ Eff(y’) ~ X*/E{z?] = 1/(1 + D?), (51) 


Eff(9) ~ 1. 


Hence, the estimator 9 is efficient. The efficiency of § and y’ will depend on the 
size of the coefficient of variation D of z in case (3) as well as in case (1). 

The large sample efficiency of # and y’ will be equal to one if and only if 
X?=E([1/g(x) |E[g(x)z*]. In the case where Var (x) and Var (r/zx) are greater 
than zero, this equality holds if and only if g(x) =b/z (case (2)). In other words, 
f and y’ are efficient only in case (2). This result follows from an application of 
Schwartz’s inequa.ity. 

The large sample efficiency of # will be equal to one if and only if 
E(1/g(x) ]E[g(x) ]=1. In the case where Var (x) and Var (r| x) are greater than 
zero, this equality holds if and only if g(z)=b (case (3)). In other words, 9 
is efficient only in caze (3). 

If the form of the function g(x) is not known, it may be possible to deter- 
mine its approximate form by an examination of the scatter diagram. With 
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this information, it will he possible to approximate the most efficient estimator 
y’’. For example, if an examination of the sample scatter diagram indicates 
that E(y| x) =az and g(x) =b/x (case (2)), then the estimator # or y’ might be 
best. 


8. ON SOME REGRESSION ESTIMATORS 


In the preceding sections, we have examined the precision of three ratio 
estimators. A method for obtaining a large variety of unbiased ratio and re- 
gression estimates has been presented by M. R. Mickey [7]. The unbiased 
regression estimators might be more useful when the line of relationship be- 
tween x and y departs in a significant way from going through the origin. It 
might be worthwhile to study the efficiency of some of the ratio and regression 
estimators obtained in [7]. 

In this section, we shall study only estimators of the form §—(#—X)z, where 
z is a function of the sample pairs (z;, y;) which converges stochastically tc 
some finite value, say Z. For example if z=7, then the estimate reduces to the 
estimator y’ (when n is large), and Z=R. If z=9/2, then 9—(#—X)(9/2) 
= 9X/z= , and the value of Z is Q=Y/X. 

Since z is assumed to converge stochastically to a finite limit, estimators of 
this general form will estimate Y consistently. The limiting distribution of 


vnl[y — (@ — X)z — Y) = valy —-(@ - X)Z-Y -@-2)@- X)] 


will be the same as the distribution of /n[j—(@-X)Z—Y], since 
Vn(z—Z)(—X) converges stochastically to zero. Hence, when n is large, the 
variance of #—(#—X)z is approximately 

n-(o,? + Z%o,? — 2Z Cov yz). (52) 
This variance is minimized when Z=Covy,:/e."=poy?/oz", where Covys=Cys 


and p=Cy./e2y. 
If 


= > yi[x; — #] D ale, — 4], 


then the asymptotic variance of the estimator y’’’ = 7 — (—_X)z is minimized. 
This minimum asymptotic variance is equal to [¢,2—C,.?/c.?]/n. Hence the 
large sample efficiency of the estimator y’ (relative to y’’’) is 


Joy? — Cy2*/o2*)/[oy* + Ro.* — 2RCyz] = (1 — p*)/(1 + F? — 2Fp), (53) 
and the efficiency of 7 is 
[o,® — Cy-*/o.*]/[o,* + Q*e.* — 2QC,2] = (1 — p*)/(1 + H* — 2Hp), (54) 


where H =Qoe,/o, and F = Ra,/a, (see p. 148 in [1]). 

The reader will note that the most efficient estimator, among those con- 
sidered in this section, was the usual regression estimator. Although we did 
not assume in this section that the type of population might be described by a 
regression model (i.e., we did not restrict the discussion to the case where 
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E[y|z]=ax+6), we find that the standard regression estimator has a smaller 
asymptotic variance than other estimators of the form we have considered. 
It can also be seen that there are many unbiased regression estimators that are 
relatively efficient; i.e., unbiased and equivalent to y’’’ when n is large. 

If we do assume that the regression model holds, some further results may be 
obtained. When E [y| z|]=azr+b, then 


Q = Y/X =a+0/X, R = Ely/z] = a + bE[1/z], Ely(x — X)] = aS?. 
The estimator 7 will be more precise than y’ if and only if 
| R — Cyx/o2*| > | Q — Cy2/o2?| (see Section 3.1); (55) 
i.e., if and only if 
| bE[1/x]| >b| X|, 


XE[i/r}>1 (6#0). (66) 


Since this is always the case when )+0 and o,’>0, we see that 7 will always 
be more precise than y’ when b+0. In the preceding section, we considered 
the special case when b=0, and saw that the estimators 7 and y’ had the same 
precision in that case. 

We have seen that when E[y|z]=axr+, the estimator y’ is never more 
precise than 7. This fact may also be obtained using the following somewhat 
more general consideration which may help to throw some more light on the 
relationship between the estimators. In general, we have that 


Q—-R=V/X — E[r] = {¥ — XE[r]}/¥ =c../X, 
where C,, is the covariance between r and z. When E[y|xz]=axr+b, then 
Cre = bE[(x — X)/x] = b{1 — XE[I1/z]}. 


Since XE[1/xr]>1, we see that C,.>0 if and only if b<0. Hence, Q>R if and 
only if b<0. Since b= Y—aX¥ =(Q—a)X, we find that when b<0, then Q<a, 
and a>Q>R. When b>0, then a<Q, and a<Q<R. Since C,:/c.2=a in the 
regression model, we see that the value of R is never nearer to C,-/c,? than is 
the value of Q. In fact, we see that when Q is too large (i.e., larger than the 
optimum value C,-/e.%), then R will be even larger than Q; and if Q is too 
small then R will be even smaller. Hence when n is large, the variance of y’ 
is never nearer to the minimum variance (the variance of y’’’) than is the 
variance of 7. 
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ON NONCOVERAGE OF SAMPLE DWELLINGS* 


Lesuie Kish AND IRENE Hess 
Survey Research Center, University of Michigan 


We present an investigation of the extent of noncoverage that can 
occur in area sample surveys of dwelling units in the United States, and 
of some relevant factors associated with noncoverage. We describe 
changes in field procedures which resulted in drastic improvements in 
dwelling unit coverage. Our noncoverage, formerly somewhere in the 
neighborhood of 10 per cent, was roughly the same magnitude as the 
total of all nonresponses; therefore, its reduction was important. In 
one year we brought our coverage rate to about 97 per cent of the 
number of occupied dwelling units reported by the Census Bureau. 
Thus noncoverage, as a source of errors of nonobservation in our sur- 
veys, has been greatly reduced in magnitude and importance. The po- 
tential bias is shown to be a function of the type of estimation used: 
large for simple expansions, less for means and usually even less for 
comparisons of subclass means. The amount of noncoverage is a func- 
tion of the essential operating conditions used by a specific organiza- 
tion; however, the usefulness of the methods and results—and the na- 
ture of the problem—are quite general to surveys. 


TYPES OF NONSAMPLING ERRORS IN SURVEYS 


T? place the errors of noncoverage in proper perspective we present below a 


brief classification of the types of nonsampling errors that arise in sample 
interview surveys from households in the United States. 


1. Errors of observation consist of the errors due to obtaining and recording 
an observation incorrectly. These may be: response errors or processing 
errors. 

. Errors of nonobservation are due to failures to obtain observations on some 
segment of the survey population: a) Errors of nonresponse include failures 
to obtain an observation (measurement) on an observational unit desig- 
nated for the sample. These may be classified as refusals, not-at-homes, 
incapacitated, lost schedules, ete. Nonresponse may be total or partial. 
b) Errors of noncoverage are due to failures to include in the sampling 
frame some members or some segments of the population as defined for the 
survey. These include various errors made in the selection process, in 
office routine, in field instructions, and in field procedures. The reverse of 
these are errors of over-coverage: including in the sample elements which 
do not belong to the sampling unit or do not belong to the population. 


Failure to include sampling units in the sampling frame results in a zero prob- 
ability of selection and in the exclusion of these units from the survey results. 
However, in this subclass of noncoverage we do not refer to any segments of 
some larger population that have been deliberately and explicitly excluded from 
the survey population. This kind of exclusion arises either from considerations 





* Some of this was presented at the Annual Meeting of the American Statistical Association in Atlantic City on 
September 13, 1957. 
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of the survey objectives or because of practical difficulties. For example, in 
surveys of attitudes of the United States population, persons under 21 years 
of age are often excluded explicitly and deliberately; residents of institutions of 
various kinds are often excluded because of practical difficulties. This kind of 
explicit exclusion differs both in intent and in result from noncoverag* caused 
by failures in the procedures. The former are part of the stated exnlicit “essen- 
sential conditions of the survey.”' If they are identifiable it is sometimes possible 
to estimate their effects. 

We separate noncoverage from nonresponse because of the great practical 
differences in the methods for and difficulties in measurement. The extent of 
nonresponse can usually be estimated well and simply from checks of the sur- 
vey returns against the sampling units designated in the selection procedures; 
and it is common these days to find the percentage of nonresponse attached to 
results of probability samples. 

However, noncoverage is difficult to estimate in many survey situations, and 
coverage estimates for area samples of dwelling units are scarce. The extent of 
noncoverage in these situations can be estimated only against some check ob- 
tained outside the survey procedure itself. This check can come either from an 
auxiliary investigation attached to the survey, or from data obtained from 
independent sources. In either case, the task of checking is an arduous task, 
seldom performed.? 

From the scarcity of literature on noncoverage, it would be wrong to con- 
clude that it is not an important source of survey errors. It is true that there 
are many sampling situations where the existence of a clear, convenient, and 
(nearly) complete frame will allow the sampler to keep noncoverage low or to 
estimate it easily and precisely. For example, it may be easy to sample the 
employees of a firm, the telephone subscribers of a city, or the students of a 
school. The national registers of some European countries provide easy and 
relatively complete frames of their populations. In these situations the estima- 
tion of noncoverage may also be relatively simple. 

This article investigates a difficult and important practical problem: obtain- 
ing coverage of the dwelling units of the United States by means of an area 
sample. The task involves some form of listing of the dwellings by the inter- 
viewer. Practitioners agree that this job is a difficult one to do well—that is, to 
a close approximation of complete coverage. Even the United States Census of 
1950 missed 2 or 3 per cent of the dwellings (see reference in footnote 2) in 
spite of its excellent organization and its concentration on complete coverage. 


PROBLEMS OF ESTIMATING THE NONCOVERAGE 


The estimation of errors of noncoverage generally entails one of two difficult 
alternatives. The first calls for a quality check by procedures which are suffi- 





1 Terms such as “essential conditions of a survey” or “true value” have been defined with some (but incom- 
plete) uniformity in several fine texts on survey sampling. See the volumes by Cochran, Deming, Sukhatme and 
Yates; also Morris H. Hansen, William N. Hurwits, William G. Madow Sample Survey Methods and Theories, 
John Wiley and Sons, New York, 1953, Vol. II, Chap. 12. 

* Both kinds of checks can be illustrated by investigations on the noncoverage of the 1950 Census of the United 
States population. The remarkable auxiliary project of the Census Bureau is summarized by A. Ross Eckler in 

“Extent and character of errors in the 1950 census,” The American Stat D ber, 1953. An investigation 
against independent sources is: “The population of the United States in 1950 classified by age, sex, and color—a 
revision of census figures,” Ansley J. Coale, Journal of the American Statistical Association, March, 1955, pp. 16-54. 
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ciently better to provide the “true value” against which the survey results can 
be compared. The quality check is necessarily expensive, both because the 
separate and better procedure is generally costly, and because the check has 
to be on a large enough scale to measure errors which are generally rare and 
“spotty.” The alternative procedure calls for a reliable estimate from an outside 
source. One difficult condition is that the outside estimate of the total must be 
based on the same units not only by formal definition, but also operationally ; 
the differences in the two definitions must be small enough to allow for mean- 
ingful comparisons between the two sources. 

The Census Bureau provides annual estimates of the number of households, 
equivalent to the number of occupied dwelling units, in the United States.* It 
is against the Census Bureau’s national estimates that comparisons in this 
paper are made. Our technique consists basically in comparing with the Census 
national estimate the occupied dwellings in the Survey Research Center 
(SRC) samples multiplied by the inverse of the over-all sampling fraction 
(probability).* Hence, the amount of noncoverage in the Census estimates is 
additive to our noncoverage estimates. These, moreover, are for the net non- 
coverage hiding whatever overcoverage may exist in them. 

In the light of the operating conditions, it seems to us that occupied dwelling 
units provide the most practical basis for comparisons of our surveys with cur- 
rent data from the Census Bureau. Comparisons based on persons seem some- 
what impractical both because these are not collected uniformly in our surveys, 
which deal with many different populations, and because of the difference in the 
bases of the two sources. In particular, the population of total persons in the 
United States is larger than the population living in dwelling units to which 
our samples are confined—larger by an appreciable number, which is difficult 
to estimate. 

While occupied dwelling units provide a useful basis for our comparisons, 
that basis is not without faults and doubts—because of possible differences in 
actual operations despite the formal similarity of definitions. Although esti- 
mates of the over-all coverage are useful, they are not entirely precise; they 
are subject both to some bias of measurement and to sampling errors. The raw 
figures of our investigation for the years 1950-1955 yield estimates of non- 
coverage somewhere between 11 and 13 per cent. However, after some adjust- 
ments, our best guess is that noncoverage during the period of our “old meth- 
ods” of block listings was about 10 per cent. With present procedures we esti- 
mate that noncoverage for the past two years has been about 3 per cent. 
Furthermore, with new controls and with wider use of changed field procedures 
we may be able to reduce this figure still further. 

The improvement during the last two years was beyond our fondest ex- 
pectations. For survey practitioners it is a rare experience to reduce so drasti- 
cally an important source of errors, and to find new procedures which are so 





* U.S. Bureau of the Census, Current Population Reports: Series P-20, Nos. 68, 75, 76, “H« holds and Famili 
by Type.” 

4 We wonder if now is the time for: a) A U.N. Statistical Sub-Commission report to standardize terms and 
recommend usage in the area of non sampling errors similar to the excellent and effective “The Preparation of 
Sampling Survey Reports,” U.N. Statistical Paper—Series C, No. 1; and b) reports of probability samples to include 
estimates of noncoverage along with nonresponse. 
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much more effective without considerably increasing costs. Moreover, the sud- 
den change came after years of attempts at improvement with changes of in- 
structions—attempts which looked good on paper but failed in the field. These 
considerations prompt us to share our experiences in the expectation that some 
of the problems are common to survey organizations in spite of differences 
among them. 

Beyond an over-all estimate of noncoverage we undertook a painstaking in- 
vestigation of its major components in different parts of the population. The 
large differences in noncoverage provide clues to some of the possible effects of 
noncoverage on survey estimates, especially for allocation of efforts, in the 
office and in the field, for the elimination of the major sources of noncoverage. 
The analysis consists of two parts, which are distinguished by the nature of the 
unit (block or place), used in the analysis. 


BLOCK ANALYSIS 
The block analysis consists of 629 blocks in 50 cities of over 50,000 popula- 
tion, for which block data were available from the 1950 Census (“block cities”). 
The basic estimate of coverage consisted of the ratio for each sample block: 


Addresses listed by SRC interviewers X Dwelling units per sample address 
Dwelling units in the 1950 Census X Growth factor for the city 


The addresses listed within the sample blocks needed two kinds of corrections. 
First, although the interviewer is asked to list dwelling units, he cannot do this 
perfectly by observing the buildings from the outside. He actually enters on 
the listing sheet visible listing units which we call “sample addresses.” Some 
additions and deletions take place during interviewing when the interviewer 
obtains more information at the sample address. The net correction factor of 
dwelling units per sample address averaged about 1.04, resulting from four 
classes of discrepancies: additional dwelling units at a sample address, sample 
addresses missed during the listing operation and added by means of the “half- 
open interval” technique during interviewing, newly constructed dwellings, 
and listed sample addresses which were found at the time of interviewing to 
contain no dwellings. The first three classes were added and the last class sub- 
tracted to translate sample addresses into dwelling units. These correction 
factors were applied in several groups of cities classified by size and region. 

Secondly, the growth factor was used to adjust for the average growth of the 
city between 1950 and the date of the listing—mostly within a year or two, and 
not later than 1954. The growth factor took into consideration the rate of 
growth of the city between 1940 and 1950 (based on Census data) and the rate 
of growth since 1950 as estimated from building permit data. 

Unfortunately these correction factors are far from precise. Hence, the over- 
all coverage figure is listed as 96 per cent in Table 513, while it should be 
nearer 90 per cent. This lack of precision also may affect somewhat the com- 
parisons between the various subclasses in Table 513; nevertheless the larger 
differences found there are real, we believe.® 








*a) Most of our data originate within a year or two of the Census of 1950; hence, the uncertainties due to 
changes are rather confined. b) We computed a standard error of about s =0.4 per block for these ratios. The simple 
formula of s/+/n, where n is the number of blocks for an estimate based on about 70 blocks points to a standard 
error of 0.05, and for 200 blocks a standard error of about 0.03. 
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The sample blocks were selected with probabilities proportionate to Census 
size, and the over-all sampling fraction was determined so as to yield a self- 
weighting sample of dwelling units for the United States. Hence, the analysis 
of these blocks reflects the condition in the population of “block cities.” 

Some of our results are presented in Table 513; the 629 blocks were classified 
in four different ways denoted, respectively, by Sections A, B, C and D. 

The coverage is higher in the blocks with higher economic ratings. The dif- 
ferences are not small: roughly six per cent higher in the “medium” and 12 per 
cent higher in the “high” group than in the “low” group. The coverage data are 
given first for the blocks classified (A) on the basis of mean rental or property 
value, as reported in the Census Block Statistics bulletins. The second classifica- 


TABLE 513 


COVERAGE RATES* BY CHARACTERISTICS OF THE BLOCK 
(Numbers in parentheses give the number of blocks on which the rate is based) 








City Size (in Thousands) 





Block Characteristics 
Total 50-99 100-249 250 and over 





All Blocks 0.96 (629) 0.92 (154) 0.94 (114) 0.99 (361) 





A) Census Average of Property Value or 
Rental 
High 1.07 (92) 1.04 (14) 0.94 (21) 1.13 (57) 
Medium 1.00 (308) 0.96 (82) 0.92 (48) 1.03 (178) 
Low : 0.93 (229) 0.88 (58) 0.94 (45) 0.94 (126) 





B) Average of Economic Ratings Assigned 
by SRC Lister 
High : 1.05 (117) 1.18 (22) 1.05 (26) 1.00 (69) 
Medium 0.96 (283) 0.90 (56) 0.91 (48) 0.99 (179) 
Low 0.94 (229) 0.89 (76) 0.92 (40) 0.99 (113) 





C) Prevalent Type of Structures in Block 
Single dwellings 0.93 (312) 0.88 (104) 0.90 (70) 1.00 (138) 
2 to 7 dwellings 0.95 (46) 0.94 (24) 0.97 (22) 
Apts. above stores 0.94 (56) 0.91 (16) 0.96 (40) 
Apartment houses 1.04 (70) — a — 
Mixtures of several classes 0.99 (145) 1.06 (24) 0.94 (25) 1.00 (96) 





D) Nature of Block Boundaries 
Simple rectangle or polygon 0.95 (400) 0.92 (84) 0.91 (65) 0.98 (251) 
Composite block (combination of 

two or more blo*ks) 0.96 (157) 0.94 (42) 0.98 (36) 0.96 (79) 
Irregular shape and/or large area 1.00 (72) 0.89 (28) 0.94 (13) 1.16 (31) 

















< Addresses listed by SRC interviewers X Dwelling units per sample address 
Dwelling units in the 1950 Census XGrowth rate for city 


(Note: Coverage ratios are subject to an estimation bias and to sampling error. The figures are to be used solely 
for comparisons and not as actual measures of coverage.) 





tion (B) is based on examination of the economic ratings assigned by the lister 
to the individual dwellings.* 

The coverage was better in the largest cities than in those of moderate size. 
Poor coverage comes from the moderate-sized cities, and there from the “me- 
dium” and especially from the “low” blocks. 





* To appraise these effects on the entire sample, it must be pointed out that the size of each stratum is not 
proportional to the number of blocks: the rates of selection of the high, medium, and low economic blocks were 
in the proportion of 4:2:1 in the earlier period and 2:2:1 later. The strata as they stand represent about 10 per cent, 
40 per cent and 50 per cent, respectively, of all dwelling unite in cities of 50,000 and over. 
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The blocks were also classified in the office according to the prevailing type 
of structures in the block (C). As expected, coverage was poor in blocks where 
small multi-dwelling structures of two to seven dwelling units predominate, 
or where many apartments are located above stores. It seems that coverage is 
very good in blocks containing chiefly apartment houses—probably because of 
the systematic design of these larger buildings. On the other hand, we were 
surprised by the poor coverage in blocks where the listing showed chiefly single 
dwellings; perhaps these blocks contain many scattered back entrances and 
second tioor apartments which remain unnoticed by the lister. 

The nature of the block boundaries (D) was of interest to us. Some people 
supposed that much of our trouble came from blocks that were large and ir- 
regularly shaped, where the interviewers had special difficulties in finding all 
of the dwellings. We found no evidence for that supposition. The “composite” 
block consists of two or more adjacent blocks grouped to provide a block of 
“sufficient” measure of size; these blocks also failed to reveal any unusually 
large noncoverage rates. It seems that the majority of missed dwellings come 
from ordinary-sized, well-defined regular blocks, in moderate-sized cities, which 
appear to the lister to be filled with ordinary single-dwelling homes. 


ANALYSIS BY PLACE 


The analysis of places (cities, towns and villages) was based on two types of 
data. In the “block cities” the coverage ratios of the individual blocks (de- 
scribed above) were averaged. For the smaller places, where the sample blocks 
were selected systematically from a numbered map of all blocks, the coverage 
ratio was computed as: 

DU’s listed in sample blocks Total number of blocks in town 


DU’s in 1950 Census Number of sample blocks in town 


The coverage ratio for each place was corrected for the factors of listing 
(dwellings per sample address) and of growth. The 113 places (cities and 
towns) which enter into the analysis are not our entire sample of places for that 
period, but we believe the conclusions are not thereby vitiated. They are all 
those for which meaningful comparison could be made; for other places either 
some data were lacking or the differences in city boundaries made comparison 
hazardous. 

In Table 515 we may observe appreciable differences in coverage of the differ- 
ent size classes. The coverage figures for the eight size classes are far from pre- 
cise; nevertheless, they suggest that medium-sized cities, those from 10,000 to 
250,000 population, had the poorest coverage. For this conclusion there is sup- 
porting evidence from checks by the 1950 Census Post Enumeration Survey 
enumerators of SRC listings. The percentage coverage showed similar variation 
in population classes, with lowest coverage for the medium-sized cities. Those 
coverage rates were: towns under 2,500, 99 per cent; cities 2,500 to 50,000, 94 
per cent; cities 50,000 to 500,000, 88 per cent; cities 500,000 and over, 96 per 
cent.’ (See footnote 20.) 








7 In addition to the average coverage rates we included in Table 515 another measure for comparing the cover- 
ages in different size classes of cities: the proportion of cities in each size class where the coverage rates fell below 1.00. 
This is seen to be highest in the medium-sized cities. 

The standard deviations within sise classes is about 0.12 per city (on the average) for cities over 50,000 and 
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TABLE 515 
COVERAGE RATES BY SIZE OF PLACE (CITY OR TOWN) 








1950 Population (in thousands) 





1.0-| 2.6- |; 10 | 25 | 50 | 100- 
2.4 | 9.9 24 49 99 249 





Coverage Rates 1.03 | 1.10 | 0.94 é 0.94 | 0.92 


Standard Error 0.06 | 0.07 | 0.08 . 0.04 | 0.02 
Number of Places 10 32 13 11 


Proportion of Places with 
coverage less than 1.00 0.40 | 0.31 | 0.62 





























Note: Coverage rates, etc.—same as under Table 513. 


The poor performance in the medium-sized cities came as an unexpected 
result—which we later put to good use. Post hoc, we reason that the cities of 
medium size are those in which irregular crowding into small multi-unit struc- 
tures (two to seven dwellings) is most prevalent; and this type of structure is 
the hardest to list well (see Table 513). This situation is in contrast with more 
widely spaced, one-family homes in small towns, and with the well regulated 
character of apartment houses and of large real estate developments in the 
larger cities. 

Is the poor coverage in medium-sized cities due chiefly to differences in 
growth rate? This supposition was contradicted by further analysis. But there 
is an interesting relationship between growth rates and coverage (Table 516)! 
The coverage is poor in both the slowest and fastest growing cities, with best 
coverage in cities with moderate growth of 0.01 to 0.02 annually. We investi- 
gated this problem only in cities of 50,000 and over; in the others the data on 
growth seemed much too unreliable for testing. Perhaps the slowly growing old 
cities have a larger proportion of low-income areas, and in the very fast growing 
cities obstacles are more often found in the form of conversions of old, large 
houses into multiple dwellings. 

There were two results which tend to show that coverage can be influenced 
strongly by variations in the nature of training and supervision. First, there 
seemed to be about 5 per cent coverage variation in the listings of different 
years. Unfortunately, any possible differences of emphasis in training for listing 
were unplanned, indefinite, and forgotten; hence they could not be correlated 
with the results. Secondly, a surprising difference was found between the two 
sets of primary sampling units (psu’s) into which our national sample of 66 
psu’s has been divided. The “A” half-sample was used considerably more fre- 
quently, during the years in question, than the “B” half-sample. Hence, the 
interviewers in the B half-sample had less experience. The results were surpris- 
ing: the B half-sample had a considerably better coverage, by over 5 per cent. 





about 0.34 for the smaller places. The computed standard error is presented for each group. The smaller variation 
for the larger cities is mostly due to the selection of blocks with probebility proportional to Census size which 
reduces considerably the variation in sample size per block. 
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TABLE 516 


COVERAGE RATES BY AVERAGE GROWTH RATE OF CITY 
Cities of 50,000 or more population in 1950 








Average Annual Growth Rate 





Item 


Under 0.010) 0.010—.014 |0.015—0 .019/0 .020—-0 .029|0.030 & Over 





Coverage Rate 0.88 0.93 0.94 0.91 0.85 
Number of places 12 10 8 11 7 

















Apparently the interviewer is at first careful about the listing instructions he 
just learned, but becomes careless later.*® 

Examination of the coverage ratios for individual cities disclosed considerable 
variation from place to place; about half of the noncoverage was attributable 
to a relatively few places. Some places are more difficult to list than others. 
Nevertheless, we suspect that a small proportion of the field workers commit 
a large proportion of the errors, while the majority do a much better job.° 


LIMITATIONS OF THESE FINDINGS 


These results deal chiefly with the problems of noncoverage of a procedure 
using “block listings” of dwellings in sample blocks with subsequent sub- 
sampling. All this was done by interviewers in a widespread national sample. 
This is an important practical sampling problem; but how relevant are these 
results to other sampling situations? How can our experiences be compared 
with those of other organizations? Any projection of our experiences to another 
situation should give consideration to the following: 


A. Other frames will result in different coverage. From a clear, convenient, 
and complete frame it is rather easy to obtain good coverage. 

B. For a single city or county it is generally easier to obtain good coverage. 
Immediate and direct supervision and checking of the field work permits 
easier and tighter control. An example of this is provided by the Detroit 
Area Study, a sample of dwellings of the city of Detroit plus its surround- 
ing Census-tracted area. The dwellings in about 250 sample blocks were 
listed by graduate students in the social sciences.’ In five yearly surveys, 
1952-1956, based on three separate listings, the coverage rate was esti- 
mated between 95 per cent and 98 per cent. 

. Entire segments completely enumerated will lead to good coverage more 
easily than subsampling of listed segments. This is believed by some 
experienced samplers, and our experience seems to bear this out. 





* The Census Bureau also found the old interviewers for the Current Population Survey doim; a poorer job, 
on questions of the labor force, than the new interviewers. See Morris H. Hansen, William N. Hurwitz, Harold 
Nisselson, and Joseph Steinberg, “The Redesign of the Census Current Population Survey,” Journal of the American 
Statistical Association, Sept. 1955, pp. 715-6. 

* Similarly in the Current Population Surveys: “As can be seen, a relatively small proportion of the inter- 
viewers made a large proportion of the errors in coverage.” Morris Hansen and Joseph Steinberg, “Control of Error 
in Surveys,” Biometrics (1956), XII, pp. 467-468. 

wR. Freedman, “The Detroit Area Stuay: training and research laboratory in the community,” American 
Journal of Sociology, 59, July 1953, pp. 30-33. 
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D. A thorough pre-enumeration of the entire segment or block before sub- 
sampling should produce good coverage. This will involve knocking at 
most doors and conducting a short pre-interview. This will prove a fruitful 
procedure in some situations, particularly those where the results of the 
pre-interview are used for a “double sampling” procedure. However, as a 
general practice, it has two disadvantages: it is expensive, and some ex- 
perts believe that it may build up resistance to or bias in the interview. 

. A half-open interval procedure was tried for several years. The interviewer 
was asked to include in the sample not only the sample dwelling but any 
other dwelling found up to, but not including, the next listed address. 
The interview schedule for the sample address had two questions dealing 
first with additional dwellings at the sample address and, secondly, “up 
to the next listed” address. Similarly fruitless were two attempts to sup- 
plement with special searches the listings of selected blocks. These pro- 
cedures for obtaining good coverage look fine on paper, but our field at- . 
tempts picked up only a per cent or two of the large noncoverage we 
knew we had. The failure of our attempts is explained by field experts 
in terms of the overriding preocccupation of the interviewer with obtain- 
ing a good response at the sample address. Perhaps better control of and 
more emphasis on these procedures could eliminate most of the noncover- 
age. 

. Could a different set of written instructions have increased the coverage 
substantially? We doubt it. We have worried about noncoverage and re- 
written instructions hopefully for eight years only to find that non cover- 
age remained at about the same level, year after year. 

. Could different training, supervision, and control produce better coverage 
with essentially the same listing methods? This question is aimed at more 
thorough changes in the entire field training and control than the changes 
in written instructions discussed in E and F. We do not know the answer. 
Our field force enjoys an enviable reputation. We keep many forms for 
painstaking control over field procedures which compare favorably with 
those of most organizations. However, the chief emphasis in the field 
training and supervision is on the psychological problems of obtaining 
good cooperation and accurate response from the respondent. Would it 
be possible to maintain these interviewing skills at their present high 
level, but at the same time give more attention to the tasks of complete 
enumeration? Could we have made the same listing methods succeed 
with more control, continuous quality checks, periodic retraining? We 
do not know. We decided instead to try some drastic changes in pro- 
cedures; and these proved so successful that we intend to continue to use 
them. 


CHANGED PROCEDURES 


A few words about our organization may lead to a better understanding of 
our choices of sampling techniques. In order to meet the demand for a constant 
flow of samples to the field, periodically we draw larger samples which can be 
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subsampled on relatively short notice for particular studies. We draw five to 
ten subsamples from each of these larger samples, and in the course of a year or 
two, we largely exhaust them. The larger samples are redrawn in individual 
places as they become exhausted. The ratio of the smaller to the larger samples 
is geared for efficiency in three ways: a) the specialized tasks of sampling are 
done relatively infrequently and in convenient “slack” periods; b) the entire 
sample is usually exhausted finally, hence its cost is distributed over several 
(five to ten) surveys; c) the sample is made to last only a year or two so that 
it does not become too obsolete. These procedures are adjusted to the nature 
and size of our organization. Consequently, some of our techniques, which give 
us substantial benefits, may be inefficient for one-time samples and for other 
organizations. We use three procedures now: 
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A. Sampling from city directories is now our preferred procedure in all cities 
for which an up-to-date directory is available. An area sample in a sub- 
sample of cities supplements the directory addresses. 

B. A sample of segments, averaging about four dwelling units, is used in 
rural areas and in some smaller urban areas. We are gradually introducing 
this procedure in the larger cities also. 

C. For some cities, including the largest, directories are not available and we 
have continued to use our existing prelistings, supplementing these with 
the half-open interval." We are now changing these places to one of the 
other two procedures. 


The over-all effectiveness of our present procedures, measured” by the aver- 
age coverage rate over several studies conducted recently, was 97 per cent. We 
consider this strong evidence of significant improvement in our coverage of 
occupied dwelling units.“ Because of the consistently high rate over several 
studies, the change can be attributed to changed procedures rather than merely 
to improved performance of the interviewers. This inference is justifiable in 
view of the fact that through eight years of efforts to improve our old pro- 
cedures—including the use of the half-open interval—our coverage remained at 
or below 90 per cent. We prepared rough estimates of coverage for our three 
procedures; these are subject to substantial errors because of lack of data for 
comparisons. The coverage rates appear to be roughly: 90 per cent with block 
listings, 95 per cent with city directories, and very close to complete coverage 
in segments. The 95 per cent coverage in places where we use city directory 
samples is rather satisfactory in view of the fact that these include chiefly the 
medium-sized cities where formerly the coverage was particularly low (Table 
515), about 85 per cent. 

It is one advantage of probability sampling over judgment sampling that 
its errors can often be measured and sometimes corrected or reduced. Moreover, 





u In a current study we are replacing the half-open interval technique with an area sample which parallels 
the city directory block supplement in design and field operation. The results of this experiment are not yet avail- 
able. Somewhat similar efforts have failed in the past. 

2 This is done by paring our estimates of pied dwellings with the Census estimates. See U.S. Census: 
Current Population Reports, Series P-20, No. 68; “Households and families by type,” Nos. 68, 75, 76. The non- 
coverage in Census estimates, 1 to 3 per cent, should be added to ours, for an over-all estimate. 

8 It is difficult to do much better. The Current Population Surveys have outstanding controls and performance; 
they report a 2 to 4 per cent noncoverage (ref. in footnote 9, p. 473). 
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its improvements (like other scientific progress) tend to be permanent, cumula- 
tive, and communicable. We recognize that our coverage estimates will be sub- 
ject to sampling fluctuations; that consistent high coverage will depend on 
continued checks and vigilance; and that our experiences need to be reinter- 
preted before they are applied to other situations. But we believe that we have 
established solid bases for continued high coverage rates for our organization. 


SAMPLING FROM CITY DIRECTORIES 


We are now using city directories as sampling frames in about 60 cities of 
roughly 3,500 to 1,000,000 population, and they represent about 32 per cent of 
our national sample. Our procedure" has two key features to justify the use of 
directories for probability samples. First, our samples consist of addresses 
drawn from the street guide section of the directories and not from the alphabetical 
list of names in the directories. Households are both more mobile and more 
changeable than dwellings. (One-fifth of the United States households move 
each year; and the composition of households is subject to many changes as 
well as to problems of identification.) Secondly, the sample of city directory 
addresses is supplemented by an area sample of dwelling units missing from 
the directory, (either missed by the city directory lister or built since the di- 
rectory listing), so that these too have their proper probabilities of selection. 

Neglect of the two key features will result in poor coverage; in a sample se- 
lected from householders’ names in the Champaign, Illinois, city directory, 
Ferber found a noncoverage of about 35 per cent, plus an unknown noncoverage 
due to new construction.“ However, proper procedures to obtain directory 
samples of cities have been practiced sporadically for decades. We have used 
these procedures successfully for several years with some care for the details 
but without great difficulty. Woolsey” describes procedures which, if followed 
carefully, should enable one to obtain a rather good sample of a single city. 

However, to use directories for national samples, more detailed and explicit 
procedures are needed. When dealing by mail with far-removed interviewers, 
provision must be made for anticipating and correcting problems and excep- 
tions. We have designed, used and tested such procedures. 

Into these general features we introduced two modifications for improving 
efficiency and quality by taking advantage of our continuous survey opera- 
tions. First, instead of individual addresses we select clusters of 5, 10, or 20 lines 
from a sample of pages. The sample of lines is converted into a sample of 
addresses by eliminating blank lines, and by noting the number of different 
surnames at each residential address. Then this list of addresses is visited by an 
interviewer to identify nonresidential addresses and addresses outside the sur- 
vey boundary, to purify and clarify the list, and to assign economic ratings to 
dwellings (for use in disproportionate sampling). For the most part the list now 





“4 Two alternate procedures were tried and found to be impractical. These were: 1) revision of interviewers’ 
listings through a checking operation with the city directories; 2) transcription of city directory listings for entire 
blocks, then sent to our interviewers for checking and revision. We also considered and rejected proposals to con- 
centrate the address sample and the area supplement in the same blocks. 

Robert Ferber, “The Effect of Respondent Ignorance on Survey Results,” Journal of the American Statistical 
Association, 51 (December, 1956), p. 578. 

% Theodore D. Woolsey, “Sampling Methods for a Small Household Survey,” Monograph No. 40, (U.S. Public 
Health Service) (Washington, 1956). 
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contains well identified dwellings (few corrections remain to be made at the 
time of interviewing); from it, samples of addresses are selected in clusters of 
desired sizes. Thus, essentially, our sample is selected in two phases. The first 
phase sample supplies addresses for five to ten samples drawn, in the second 
phase, over a period of 10 to 15 months. Thus, our samples are based on up-to- 
date “listings” prepared generally within a few months of their use. 

Secondly, the area supplement is selected in two or three phases. We concen- 
trate the supplement in a stratified subsample of about a fifth of the eligible 
cities to reduce the necessary effort for what is a relatively small yield. In these 
cities sample blocks are selected in groups of two or four blocks in larger cities, 
singly in small towns. The interviewer visits and reports briefly on these blocks 
and on that basis a subsample of blocks is drawn for each study. For the se- 
lected blocks a list of all directory addresses is transcribed. The interviewer 
looks for omitted addresses and interviews at a designated subsample. We find 
that the block supplement contributes about 6 per cent of the total directory 
sample. The directory did not “miss” all these dwellings; some are newly built 
and some merely represent cases where the directory listed them under dif- 
ferent addresses. We found it practical to include these also in the supplement. 

Our sampling with directories—with block supplements—is satisfactory, and 
a great improvement over our previous listing procedures in medium-sized 
cities, where our listings were poorest. It has two important advantages. First, 
each address is covered by two operations: listing for the directory, and a check 
by our field staff; the radical difference between the two procedures reduces 
the likelihood of noncoverage. Secondly, sampling from directories is generally 
no more expensive than a good listing—it may be somewhat cheaper. 


THE SAMPLE OF SEGMENTS 


There is nothing new about the idea of selecting small, compact segments to 
be completely enumerated on a specified survey.'7? One needs compact seg- 
ments with good boundaries, but with few dwellings. This is often difficult; in 
some situations prelisting and subsampling are needed to arrive at a “segment” 
which is not compact but composed of an average of about four dwelling units 
selected from a listing of a larger number of dwelling units. We shall describe 
briefly the more interesting features of our segment procedures. 

For more than two years we have used segments averaging about four dwel- 
ling units each in the rural areas and in a few urban places in our psu’s. Seg- 
ments are selected in two phases. First we select “chunks” with a measure from 
about 20 to 30 dwellings, depending upon the particular area. It is easier to 
achieve definable boundaries for clusters of 30 dwellings than for segments; 
but sometimes even this requires the help of our interviewers, who provide us 
with detailed information not available from maps. Each sample chunk is 
visited in a preliminary field operation to divide it into segments and to supply 
dwelling unit estimates. Thus our map information is brought up to date and 





17 The arguments for compact segments are made very clearly in W. Edwards Deming, On the Possible Types 
of Sampling Unit in the Last Stage of Sele:tion in a Probability Sample, New York: Advertising Research Foundation, 
1955. Among samples using compact segments are the current Population Surveys of the U.S. Census Bureau and 
the Labor Force Surveys of the Canadian Bureau of Statistics, 
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we protect ourselves against “surprises” in the form of large-scale new con- 
struction (or large trailer camps) at the time of interviewing. This reduces the 
effect of variations in cluster sizes on a single study. Through subsampling the 
effects of “variation” are spread over a number of studies. 

In the second phase, all dwelling units within the selected segments are listed 
and located on the sketch of the chunk. In addition to providing the inter- 
viewer with a county map on which each chunk is outlined, we prepare a de- 
tailed sketch of each chunk showing all segments within the chunk. Duplicate 
folders are prepared with a sketch mounted in each. One folder is kept by the 
interviewer; the other travels from the office to the interviewer and back to the 
office for control checking each time a segment is used for interviewing. 

The two phase procedure has several advantages over a direct selection of 
scattered segments. Because the interviewer returns frequently to the same 
localities, it reduces chances for coverage errors. The sampling costs are spread 
over several surveys because virtually every segment in a chunk is selected for 
a study within the span of a year of two. It reduces variation in sample size, 
improves coverage (with the use of checks) and reduces the interviewers’ prob- 
lems at the time of interviewing. 


ACCOUNTING AND CONTROL OF SAMPLE SIZE 


The confusing discrepancies between sample addresses, dwelling units and 
respondents can be handled well only with thorough procedures. We developed 
a set of standard codes and a procedure for standard tabulations which are 
prepared after the completion of the field work for each study. From these we 
derive many useful estimates, the most important being: 


we occupied dwellings found 





a) Coverage rate 


occupied dwellings expected 


interviews 





b) Response rate = -— . 
eligible respondents 


This information also allows us to plan the size of future studies more accu- 
rately. Furthermore, since information is available for individual places, the 
performance of one or two interviewers in a place can be followed and con- 
trolled better. During the interviewing period a continuous flow of completed 
questionnaires and control forms come into the central office where they are 
compared with a master control list for the study. Any questionable deviations 
from the master control are discussed with the interviewers, preferably while 
there is time to correct the field work. 


EFFECTS OF NONCOVERAGE 


It is useful to present a rough idea of the likely effect that the noncoverage 
has on some types of survey statistics. (The model is also relevant for non- 
responses.) Denote by Y= Y/N=W1.i¥1+W32Y:2 the mean of some character- 
istic y of the population, where N is the number of population elements; W; and 
W; are respectively the proportions of coverage and noncoverage in the sample 
and W,+W:2=1; also Y; and FY: are respectively, the means of the character- 
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istics in the two segments. Let us look at the relative bias (R.B.) in three im- 
portant types of survey estimates. 


a) The R.B. in estimating Y, the population total, by simple expansion is: 
N(Y —_ WiY,)/Y = N(W:2Y:)/NY = W:Y2/Y = W:Z, 


Often Y:+Y and the R.B. is about the size of the noncoverage W:; how- 
ever, in situations where Y;+0 the R.B. becomes negligible. 
b) The R.B. in estimating the population mean F is: 


(Y aey Y,)/Y = Wi(T2 — Y,)/Y = W2Z, 


If Y.+Y, the R.B. is negligible. For the bias to be important there must 
exist a coincidence of large noncoverage and large differences between the 
means of the two segments. 

c) The difference (Y4—Y,) between the means of two subclasses (A and B) of 
the population is subject to the R.B.: 


{(W2(¥: — Yi) la — [W2(¥s — Yi) Ja} /(Vu — Ys) 
= W.'{(¥: — Vida — (¥: — Vida} /(Vu — Yu) = WZ. 


For the bias to be important in the comparisons of subclasses (such as age 
or occupation classes) the effect of the bias on one subclass must be much 
different than on the other subclass. (This assumes that the noncoverage 
in the two subclasses is about the same: We’ = W2,4 = W2z). 


The importance of a bias should be judged in relation to the other errors of 
the survey. Since nonresponse in our surveys is in the neighborhood of 10 per 
cent, the reduction of noncoverage from 10 to 3 per cent was important, but 
further reductions, while not negligible, are less so. Leaving aside the errors of 
response and of nonresponse, let us look at the relationship of noncoverage to 
the sampling error. This problem is broad and we shall have to reduce it to fit 
it into the modest ambitions of this paper. A simplified model can still be useful. 

The sampling error can often be expressed in terms of the coefficient of varia- 
tion (the ratio of the standard error to the estimate), denoted by V=C/4/n. 
Here “C” is the coefficient of variation per element and n is the number of ele- 
ments in a simple random sample. (Complex designs can be expressed in terms 
of “equivalent simple random size.”) The relative bias can be expressed as 
R.B. =W2Z; here Z is a generalized bias term representing, as required, any 
of the three terms in a) or b) or c) above; and for convenience we shall take Z 
always in the positive sense. 

The ratio R.B./V = W2Z/V is the “effect” of the relative bias as a ratio of 
V=C/<. It is also equal to the ratio of the bias to the standard error of the 
estimate. If R.B./V =0.1 the P =.05 level of confidence is distorted to .0511; 
we call this effect (or less) “negligible.” If R.B./V =0.2, the P =.05 level is dis- 
torted to .0546; this effect is “small.” If R.B./V =0.6, the P=.05 level is dis- 
torted to .0921; this effect (or more) is “large.” If 0.2<R.B./V <0.6, we say 
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the effect is “neither small nor large.”!* Table 524 shows the sizes of the general- 
ized bias term (Z) necessary to produce a “negligible,” (0.1), or a “small,” 
(0.2), or a “large,” (0.6), “effect” respectively, for different proportions of non- 


coverage (W2) and for different sample sizes (n). Specifically, columns 2, 3 and 
4 of Table 524 show the evaluation of 


a C (Bias effect) 
— Was/n 





Z 


for C=1 and for alternate bias effects. 

The coefficient of variation (C) per element was assumed to be 1.0. This is in 
the nature of a central value. Income and liquid assets are in that neighbor- 
hood; for proportions near .50 to (C= V/q/p=-+/.5/.5. For values of p near 
.20, C=+/.8/.2=2. For some estimates C' will be less.!® When C is known to 
have a different value, it can be used to multiply the entries in Table 524. 

Now for some rough conclusions. 


a) For estimating Y, the value of Z is usually near 1.0. In Table 524 we see 
that if W.=.10, “large” effects are produced for even the smallest samples. 
Even for W.=.03 “large” effects are produced for all samples for n>400. 
Because of this and because of the large sampling errors, estimates of Y 
by means of “simple expansions” of our samples—and of most household 
surveys—are not attempted. Our expansions use averages (VY) and are 
based on Current Population Survey estimates; these correct for non- 
coverage by tying in to the decennial Census with current vital statistics. 
Estimating values of Y are among the important objectives of our surveys. 
The sizes of our surveys are mostly near the 1600 to 2500 range—in 
“simple random equivalent size.” There is some evidence from the Census 
Post Enumeration Survey” that for income Z, it is about 0.2. In general, 
we would guess that for many items Z, would be between 0.05 and 0.25. 
On that basis, when our noncoverage was W,=0.1 most of our items had 
“effects” greater than “negligible,” and some of them “large.” Contrari- 
wise, with our present W: about .03, we are in the region where none of 
our results suffers from a “large effect,” and most are probably at or near 
negligible. For larger surveys (n= 10,000) even this small noncoverage 
(W:=.03) may result in “effects” above the “small” type. 
ec) For comparisons of subgroups of our surveys we are in the size classes of 
n= 100 and n=400. (A comparison of two subgroups, each with n =800, 
has a sampling error about as large as a sample of n = 400.) For this cate- 
gory we suspect that even the former large noncoverage (W2=.10) usually 
resulted in small or negligible “effects.” On the other hand, in larger 
samples these comparisons may also suffer effects which are not “small.” 


b 


—— 





18 For effect on confidence levels of the ratio Bias/Standard Error, see Table 1.1 in William G. Cochran, 
Sampling Technigq John Wiley and Sons, New York: (1953), p. 9. 

18 For a display of values of C for some characteristics see Hansen, Hurwitz and Madow, op. cit., pp 141-146. 

2 Monroe G. Sirken, E. Scott Maynes, and John A. Frechtling, “Reasons for differences in survey estimates of 
income: survey of consumer finances and post-enumeration survey,” in Report of the 1956 Conference on Research 
in Income and Wealth, National B of E ic Research, in press. 
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TABLE 524 


VALUES OF Z NECESSARY TO PRODUCE A “NEGLIGIBLE,” “SMALL,” 
OR “LARGE” “BIAS EFFECT’; FOR DIFFERENT NONCOVERAGE 
RATES AND FOR ALTERNATE SAMPLE SIZES* 























Noncoverage rate 
W: =0.10 W: =0.03 
Sample Size 
Bias effect Bias effect 
0.1 | 0.2 0.6 0.1 0.2 0.6 
n Negligible Small Large Negligible Small Large 
(1) (2) (3) (4) (2) (3) (4) 
100 -100 .200 .600 .333 .667 2.000 
400 .050 .100 .300 -167 .333 1.000 
1,600 .025 .050 .150 -083 .167 .500 
2,500 .020 .040 .120 .067 .133 .400 
10,000 .010 .020 -060 .033 .067 .200 























* The “Bias Effect” is the ratio of the noncoverage bias to the standard error; values of 0.1, 0.2 and 0.6 are 
denoted respectively by “negligible,” “small,” and “large.” 
C (Bias effect) 
Z is a generalized bias term in Z = ra ; where Ws is the proportion of noncoverage; and the 
avi 
sample size, n, is in “simple random equivalent.” Z may refer to Ze, Zp or Z- as explained in the text. It is assumed 
that C=1. 














COMPLETE COUNTERBALANCING OF IMMEDIATE 
SEQUENTIAL EFFECTS IN A LATIN 
SQUARE DESIGN 


James V. BRADLEY 
Aero Medical Laboratory, Wright Air Development Center 


If there is an even number of experimental conditions (Latin letters), 
it is possible to construct a Latin Square in which each condition is 
preceded by a different condition in every row (and in every column, if 
desired). These designs are useful in counterbalancing immediate se- 
quential, or other order, effects. A simple, and easily remembered, pro- 
cedure by which to construct such squares is described and illustrated. 
A proof is offered which shows that the procedure is valid for any size 
square having an even number of cells on a side. 


1. INTRODUCTION 


upposs that each of 2k individuals is to be exposed once to each of 2k ex- 
S perimental conditions. In some cases it may be suspected that reaction to 
any given experimental condition will be influenced (a) by the number of experi- 
mental conditions preceding it, i.e., by its ordinal position in the sequence of 
presentation, (b) by the particular experimental condition immediately preced- 
ing it. This situation is commonly encountered in psychological experiments 
when every subject is run under all experimental conditions. Such experimental 
designs reduce variance at the expense of introducing sequential effects such as 
learning, fatigue and interactions between reactions to various experimental 
conditions, particularly those adjacent in order of presentation. 

Under the conditions stated it is possible to counterbalance (a) and (b) above, 
i.e., for each of the 2k individuals a given experimental condition is preceded by 
a different number of conditions and by a different immediately preceding 
condition. The method has been discussed by Bugelski [1] and illustrated in the 
case where 2k=6. A design appropriate to the case where 2k=4 can be found 
in Cochran and Cox [2]. The general use of Latin Squares to eliminate un- 
wanted experimental effects has been discussed by Grant [3]. It is the purpose 
of the present note to supplement this earlier material by presenting a very 
simple and easily remembered method by which such counterbalanced designs 
may be obtained and to prove its validity for any integral value of k. 


2. METHOD 


(a) Construct a 2kX2k table in which rows represent individuals and col- 
umns, from left to right, represent ordinal positions, from first to last, in which 
experimental conditions may occur in the sequence of condition-presentation. 
(b) Number the 2k experimental conditions successively from 1 to n. (c) Assign 
successively the integers from 1 to n to the n cells in the first row by proceeding 
from left to right entering only cells in odd-numbered columns, then reversing 
direction and filling in cells in even-numbered columns. (d) In each column, 
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starting with the number already entered in the top cell, proceed downward 
entering in each cell the integer immediately following the one in the cell above 
it, except that the integer n is considered to be followed by the integer 1. 

' It will be found that any given integer appears once in each row, once in each 
column, and is preceded once (and is followed once) by each of the other 
integers and once by none of them. The following table illustrates the method 
for the case where 2k=8. 

It is to be noted that while immediate sequential effects are completely 
counterbalanced, remote effects are not. For example in the table shown, the 
integer 1 is thrice preceded by an 8 and thrice by a 2 in the second cell preceding 
the cell which it occupies. 


TABLE 526 
ILLUSTRATION OF METHOD FOR CASE WHERE 2k=8 




















Order of Presentation of Experimental Conditions (Cell Entries) 

wae to Individuals 
Individual 

Ist IInd IlIrd IVth Vth Vith VilIth VIIIth 
A 1 8 2 7 3 6 4 5 
B 2 1 3 8 4 7 5 6 
Cc 3 2 4 l 5 8 6 7 
D + 3 5 2 6 1 7 8 
E 5 4 6 3 7 2 8 l 
F 6 5 7 4 8 3 1 2 
G 7 6 8 5 1 4 2 3 
H 8 7 1 6 2 5 3 4 

3. PROOF 


The proof will consist merely of showing that the steps outlined under Pro- 
cedure are sufficient to insure that each integer (a) appears exactly once in 
each column, (bd) appears exactly once in each row, (c) is preceded exactly once 
by each of the other integers. Consider the integers from 1 to n arranged clock- 
wise around a circle in increasing order of magnitude. Call the number of 
integers passed or reached in proceeding clockwise from one integer to another 
the “separation” between the two integers. By construction each cell entry in 
each column has a separation of 1 from the entry in the cell above it. Since 
there are n such cells in each column, each of the n integers appears exactly once 
in each column. And since, by construction, the top cell in each of the n 
columns is occupied by a different one of the n integers, each row must contain 
each integer exactly once. Phrased differently, each cell entry in a given row 
has the same separation from the corresponding cell entry in row 1; and since 
each of the n integers appears exactly once in row 1, they must do likewise in 
all other rows. 

If there are an even number of columns, the cell entries in the first row will be 


1,n,2,n —1,3,n—2,4 Et erie Bi yy 
,n, 4,n — lon — Steet i ee ee PEAT 1: ’ 
P 2 2 2 2 
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in order from left to right. Proceeding in the same order, the separations be- 
tween entries in pairs of adjacent cells are respectively n—1, 2, n—3, 4, n—5, 
6, ---,3,n—2, 1. Thus, the separations, in the order given, form a decreasing 
progression of odd numbers interlaced with an increasing progression of even 
numbers, and each of the n—1 separations appears exactly once. 

By construction, each cell entry in a given row has a separation of exactly 1 
from the corresponding cell entry in the row above it. Therefore every row will 
have the same separations between entries in adjacent cells as exist for the cor- 
responding cells in row 1. That is to say the separation existing between a pair 
of adjacent cells in row 1 is an attribute of the corresponding pair of adjacent 
columns. 

Now consider any given integer, i, and let p represent whatever integer im- 
mediately precedes i in the same row when 7 appears in the 2k X2k table. Since 
each of the n—1 pairs of adjacent columns is associated with a different one of 
the n—1 possible separations, and since i appears exactly once in each column, 
p must assume each of the n—1 integral values other than 7 exactly once. The 
same reasoning demonstrates that each of the other integers must follow i 
exactly once. 


4. MODIFICATIONS 


As the table stands, each condition is immediately preceded once by each of 
the 2k—1 other conditions and once, when it appears in the Ist column, by no 
condition. Thus the set of immediately preceding conditions is not the same for 
every condition since a condition does not immediately precede itself. A remedy, 
suggested by the Referee, would be to add a zeroth column, identical to and 
just to the left of the Ist column. The data collected under the zeroth presenta- 
tion would be discarded, and the sole function of the column would be to 
complete the balance by substituting “preceded by itself” for “preceded by 
nothing.” Thus every condition would have exactly the same set of immedi- 
ately preceding conditions. In many fields an experimenter might be reluctant 
to take data which subsequently must be discarded. In psychological experi- 
ments, however, since initial performance is frequently characterized by atypi- 
cal scores and high variability, a “practice” period is often introduced previous 
to experimentation. The zeroth presentation therefore could serve the function 
of the practice period with no loss of research efficiency. 

It has been pointed out by the Referee that the design property shown to 
exist within rows, i.e., that a given integer is immediately preceded (and is 
immediately followed) once by every other integer, can be made two-dimen- 
sional so that it is true of both rows and columns. The proof already given is 
valid for any permutation of rows. Therefore in order to counterbalance for 
adjacency in both dimensions, one need only construct the table already out- 
lined, then permute rows so that reading downward in the first column one ob- 
tains the same sequence of cell entries as exist from left to right in the first row 
of the table already discussed. The table shown below represents this modifica- 
tion applied to the table presented earlier. 
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TABLE 528 


MODIFICATION OF TABLE 1} COUNTERBALANCING ORDER 
EFFECTS IN COLUMNS AS WELL AS ROWS 
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It should be noted that while rowwise and columnwise adjacency is com- 
pletely counterbalanced, adjacency in the diagonal dimension is not. 
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FACTORIAL EXPERIMENTATION IN SCHEFFE’S ANALYSIS 
OF VARIANCE FOR PAIRED COMPARISONS 


Orro DyxstTRA, JR. 
General Foods Research Center 


In applying paired comparisons methods we sometimes find that the 
samples to be compared represent factorial combinations. It is found 
that Scheffé’s method of paired comparisons may be applied for an 
underlying factorial design. The larger designs tend to become pro- 
hibitive, since to double the number of test samples requires about four 
times the number of judgments. A blocking precedure is described 
where only twice the number of judgments are required to double the 
number of samples, taking in account an underlying 2-series factorial. 
The larger designs provide many degrees of freedom for order effects 
but may not provide any for a true error estimate. We obtain degrees 
of freedom for error by running some pairs twice in only one order. 
We again may use the underlying factorial to decide which pairs to run 
once in each order and which pairs to run twice in only one order. 


1, INTRODUCTION 


AIRED comparisons techniques are finding wide application in industry, 
due in part to the recent literature on the subject. Although much of this 
literature deals with rank analysis, we shall concern ourselves with the analysis 
of variance of scored preferences as devised by Scheffé [4]. 
The purposes of this paper are four-fold: 


1. To review the experimental setup and the mathematics of the Scheffé 
procedure (Section 2). 

2. To discuss the application to factorials, with emphasis on the 2?~? frac- 
tional factorials (Section 3). 

3. To describe a blocking procedure which simultaneously fractionates the 
required numbers of observations (Section 4). 

4. To describe a method for obtaining an error term in large experiments 
where no error term could otherwise be obtained (Section 5). 


The procedures are illustrated by a worked example (Section 6). 

The reader is referred to Daniel [1] and Davies [2] for a discussion of frac- 
tional factorials. The paired comparisons methods were recently discussed by 
Jackson and Fleckenstein [3]. 


2. SUMMARY OF SCHEFFE’S PROCEDURE 


There are m treatments to be compared in all M =m(m—1)/2 possible pairs. 
We obtain 2r observations per pair, r in each order of presentation. The obser- 
vations are scored statements of preference between pair members and range 
on a 7-point scale between +3 and —3. The preference scores are positive if the 
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first member of a pair is preferred, negative if the second member is preferred, and 
0 if there is no preference. The points of the scale are described by Scheffé as: 








Score Statement on Pair (i, 7) 





I prefer 7 to j strongly 
I prefer i to 7 moderately 
I prefer i to j slightly 

0 No preference 
—1 I prefer j to z slightly 
—2 I prefer j to i moderately 
—3 I prefer j to i strongly 








Considerable difficulty with approximate normality of observed scores was 
encountered when Scheffé’s method was first applied, but by experimenting 
with our interviewing technique and the wording of the preference questions, 
we were able to find a technique which yielded nicely distributed results. It 
should be noted that 5-point scales may also be used. Again, it would be neces- 
sary to establish a proper testing technique before using these scales. 

We define z,;, as the kth observation on the ordered pair (7, 7) and assume all 
Zije are independent random variables with mean y,; and variance o’. 

The mean preference for treatment i to treatment 7 is u.; when presented in 
the order (7, 7) and —y;, in the order (j, 7). The average preference z,; and the 
average difference due to order of presentation 6,;; are given by: 


mis = 3(uis — mya) i= — ij, 


(1) 
55 = 3(usus + mya), 555 = b,j. 


The average order effect 5 is given by 
6 = > uis/(2M) 
i4j 


= b » 5;;/M 


<j 


(2) 


where M equals the number of pairs, M =m(m—1)/2. The first summation in 
(2), and in similar subsequent summations, is over all values of i and j with 
i,jg=1,---, mand ix¥j. 

Scheffé, in his “hypothesis of subtractivity,” postulates that there exist pa- 
rameters a, a2, - - - , & Characterizing the m treatments, such that the average 
preference x,; for 7 to 7 is the difference between the corresponding parameters 


Tig = A — Aj, tui = 0, (3) 


so the “deviations from subractivity” y;; are given by 


Vig = Tig — (as — a). (4) 


Since >", ry=ma:— 7%, aj, Scheffé adds the assumption 7%, a;=0 to 
permit estimation of the a;. Accordingly, >°7, xj=ma;. The a; may be re 
garded as main effects. 
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Since the 2; are random variables, e;; is used as the “error” in 2%, so that 
€sjk = Xijk — wij. The components which have been discussed are summarized be- 
low, together with the appropriate sums of squares resulting from partitioning 
the total sum of squares, using the analysis of variance table suggested by 
Scheffé. The degrees of freedom and sum of squares for order effects may be 
partitioned into 1 and M—1 degrees of freedom, as shown. As a result, we do 
not include in the totals the parenthesized entries M and S;. This table also 
shows the identities between the sums of squares. To complete the analysis of 
variance we compute the mean squares and the F-ratios, comparing each mean 
square to the error mean square. The last column has been included to show 
what the F-ratios are testing. 


ANALYSIS OF VARIANCE 





























s Degrees of Sum of He: 
i Freedom Squares = 
Main effects (a; —a;) m—l1 Sa ox = 0 
Deviations from subtractivity (-+:;) M—(m-1) S, vii =O 
Average preferences (7;;) M S, 
Order effects (8;;) (M) (Ss) 
Average order effect (8) 1 2rM 8? 6=0 
Diff. among order effects (5;; —é) M-1 Ss; —2rM & 5; -6=0 
Means (ji;) 2M S, 
Error (¢;;%) 2M (r—1) S, 
Total (2;;x) 2rM S; 








The following formulas will aid in estimating the various effects and comput- 
ing the sums of squares: 


Effect Estimation Sums of + 


ye s.= DD 20 


i¥7 





bij Thijs = Dy Lage Ss, =— -> (rpij)* 
kel tj 
ij Qrmig = Tus — Tuy S, = — % (2rmi;)* 
2r i<j 
<a ae “ 
bi; 2réi; = Thi “+ Tj Ss = — > (2r6;;)? 
lr i<j 
5 rMi= Dam — 4Mi = —( p> rw) 
inj 2rM ix] 


TR 
R 
ll 


. on 1 
a; 2rma; = , 2rrij -——-- = (2rma;)? 
jai 2rm | 





532 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 


The advantage in using these formulas rather than those given by Scheffé 
arises from using integers throughout all the computations. The remainder of 
the sums of squares needed for the analysis of variance table are obtained by 
subtraction. Thus, S,=S,—S, and S,=S,—S.,. 


3. APPLICATION TO FACTORIALS 


Up to this point we have not discussed the m treatments, which may have 
been m brands of a product or m factorial combinations, in which case we may 
consider the @; as responses. Let us consider p factors, A, B,---, P. The 
number of levels will be a, b,---, p, respectively, so that there are 
m=ab---, p total combinations. Thus we may find m—1 orthogonal con- 
trasts 6; such that 


6; = >> eid, > es = 0; j=l1,--:-:,m-—1; and 
tm t=1 
“ (5) 
Dd ejscrs = 0; RR=1+++s.m—1ligeFk. 
t=] 


The sum of squares for main effects S, is thus partitioned into m—1 compo- 
nents, each with 1 degree of freedom, with each contrast 6; contributing 


Ormé;? / Det = [ : enrma.) | / (rm) & es! (6) 


to S,. In private correspondence Scheffé has shown 


oc? 
Var (6;) = C;<*. (7) 
2rm ; 

In the case of 2? factorials, the above method is changed slightly if fractiona- 
tion is necessary (we would usually run all 2? combinations only if p is small, 
i.e. not larger than 4). For fractional factorials the number of combinations is 
m=2"-%, where 2~¢ is the degree of fractionation. In determining the contrasts, 
cj:= +1, so that the sum of squares attributed to each contrast is 


2 
2r6;? = | X eras) | f m(2rm). (8) 


In practice, we would use the 2rm@; as responses and use Yates’ procedure to 
obtain the m—1 single degree of freedom contrasts. The variance of a factor 
effect 6;/m is given by 
Var (6;/m) = o?/2rm?. (9) 
This demonstrates the advantage in using a factorial design, since 
Var (@;) = (m — 1)o?/2rm = (m — 1) Var (6;/m). (10) 


The need for modification of the Scheffé procedure is demonstrated in Table 
10D. As m increases we need about four times as many observations to double 
the degrees of freedom for main effects. The blocking procedure described 
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below allows us to double the degrees of freedom for main effects by doubling 
the number of observations. Then the following section discusses a better allo- 
cation of the degrees of freedom for order effects and for error. 


TABLE 533 
ALLOCATION OF DEGREES OF FREEDOM (r=1) 








Degrees of Freedom 





Source 
m =32 m = 64 





Main effects 31 63 m—1 
Dev. subtractivity 465 1953 (m —1)(m—2)/2 





Avg. preference 496 2016 m(m—1)/2 
Order effects 496 2016 m(m—1)/2 





Means 992 4032 m(m—1) 
Error 0 0 0 








Total 992 4032 m(m—1) 





4. BLOCKING 


Attention is now restricted to 2”~¢ fractional factorials. We have 2? possible 
factorial combinations. Assuming that 2” is too large to run, we take a 2-¢ 
fraction of the possible combinations, so that our design consists of 2?~¢ 
factorial combinations. We shall run these combinations in two blocks with 
m=2°-*! treatments per block. Each block is run as a separate design.'! Thus 
all possible pairs of treatment combinations in each block are run, but the 
treatment combinations in one block are not compared with any of the treat- 
ment combinations in the other block. The following schematic diagram repre- 
sents what is happening where p—q=4. The appropriate analysis procedure 
is to analyze each block separately and then combine the analysis of vari- 
ance tables, adding the respective sums of squares and doubling the degrees 
of freedom. 


,8.s@ 9---16 





Block 1, m=8 | Pairs not run 





Pairs not run | Block2,m=8 








16 











1 Using 2 blocks is similar to taking a 2~¢— fraction in each block. 
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Since r= 1 (we would prefer to run a bigger design rather than increase r), 
the variance of a factor averaged over both blocks is 


V[(6;/ + 6;/’)/2m] = o7/4m?, (11) 


where 6’; and 6’’; are the jth contrasts in the first and second blocks, respec- 
tively. As expected, this is twice what the variance would have been if the de- 
sign had had 2m treatments, even though we have only one-half the number of 
judgments required for the usual 2m design. 


5. ALLOCATION OF DEGREES OF FREEDOM FOR ORDER EFFECTS AND ERROR 


When r=1 we have no degrees of freedom for error estimation. Scheffé con- 
siders this situation and suggests using the order effects mean square for the 
error term. An alternative suggested here is to run some pairs as indicated, once 
in each order, and to run some pairs twice but in only one order. We allow 
degrees of freedom for order effects in the former case and for error in the 
latter. 

The procedure to be described is applied to the 2” full factorial, to facilitate 
the description (there is no difficulty in applying the procedure to the 27-¢ 
fractional factorials, either unblocked or blocked as in Section 4). The m treat- 
ment combinations are divided into two groups, as in the blocking. For example, 
for five factors we may use ABCDE to group the treatment combinations. The 
two groups are shown in Table 534. Each treatment combination will be com- 
pared once in each order with each of the other treatment combinations within 
the same group. The treatment combinations in one group will be compared 
twice in only one order with each of the treatment combinations in the other 
group. Thus m(m—2)/4 pairs are run in each order and m?/4 pairs in only one 
order. 


TABLE 534 
GROUPING FOR FIVE FACTORS, USING ABCDE 








Group 1 Treatment Combinations Group 2 Treatment Combinations 





(1) ad ae de a d e ade 
ab bd be abde b abd abe bde 
ac ed ce acde ec acd ace ede 
be abed abce bede abc bed bee abcde 








In the event there is a bias for either the first or second sample presented, 
we must balance the direction of running the pairs for each treatment com- 
bination. An example of a balanced design for three factors is given in Table 
535. The treatment combinations were grouped using ABC (it will generally 
be easiest to use the highest letter combination for grouping). Thus, those 
combinations which have 0 or 2 of the letters a, b, and c fall into the first group, 
while those which have 1 or 3 of the letters fall into the second group. The de- 
cision whether to run 0 or 2 judgments in the intergroup comparisons was based 
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TABLE 535 


ALLOCATION OF JUDGMENTS (THE NUMBERS IN THE CELLS 
INDICATE THE NUMBERS OF JUDGMENTS TO BE 
OBTAINED IN THE INDICATED DIRECTIONS) 








Treatment Combination Presented Second 





Treatment 
Combination Group 1 Group 2 
Presented First 





ao 
a 


b 


ab ac 


~~ 
— 
~~ 





(1) 
Group 1 ab 
ac 
be 


— 


1 
x 
1 
1 


Meee 








Group 2 


nN oOo © 























Xx 
1 
1 
1 


|oonw 
to 


abe 





on AB, which may be used regardless of the number of factors. Where the 
treatment combinations in the first group are presented first in comparison 
with the treatment combinations in the second group, the comparisons of the 
combinations agreeing with respect to AB are run twice, while those differing 
are not run. (By “agreeing” we mean that both combinations have either 0 or 2 
of the letters a and b, or that both have only one of the letters.) The situation 
is reversed when those in the second group are presented first against those in 
the first group, since the comparisons of the combinations differing with respect 
to AB are run twice. Thus, apart from the order of presentation, two judgments 
are obtained on each pairing. The degrees of freedom for experiments of various 
sizes, run in two blocks with the judgments arranged in each block as described 
above, are given in Table 536. 


6. A WORKED EXAMPLE 


The example chosen to illustrate the factorial, blocking, and balancing pro- 
cedures was part of a series of experiments designed to improve an existing 
product. There were four factors in the experiment as described below: 








Factor Levels Letters 





Type of Raw Material A, B, C 
Acid Level D, E 
Acid Type ‘ F 
Additive Level G 

















2In general, we should like to be clear of main effects in making this decision. Thus, we may use a letter combi- 
nation with more letters, if the combination is not confounded with a main effect in a fractional design. It does 
seem, however, that the two letters are far enough removed from main effects for this purpose. 
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TABLE 536 


ALLOCATION OF DEGREES OF FREEDOM, USING TWO BLOCKS 
OF m EACH AND re-l1 








Source 


Degrees of Freedom 





2m =32 


2m = 64 2m 





Main Effects 
Dev. Subtr. 


30 
210 


62 
930 


2(m—1) 
(m —1)(m—2) 





Avg. Preferences 
Order Effects 
Error 


240 
112 
128 


992 
480 
512 


m(m—1) 
m(m —2) /2 
m?/2 





Total 





480 


1984 


2m(m —1) 





Admittedly any fraction of the 128 possible combinations would not allow the 
clear estimation of two-factor interactions. Nevertheless, only one-half of these 
combinations were run, with I= —-ABCDEFG as the defining contrast. The 
blocking generator was ABDF—CEG, giving 32 combinations in a block. 
ABCDE, representing the block live letters, was used to group the pairs, de- 
ciding which pairs to run once in each order and which to run twice in only one 
order. AB was used to decide in which of the two orders to run those pairs to be 


run in only one order. 


TABLE 536A 
EXPERIMENTAL TREATMENTS COMBINATIONS 








Block 1 


Block 2 





Group 2 


Group 1 
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bf 
cg 
abcg 
df 
abdf 
acdg 
bedg 
eg 
abeg 
acef 
beef 
adeg 
bdeg 
edef 
abcdef 





fg 
abfg 
ac 

be 
adfg 
bdfg 
ed 
abed 
ae 

be 
cefg 
abcefg 
de 
abde 
acdefg 
bedefg 


COnNOarrkwn 
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TABLE 537 
EXPERIMENTAL RESULTS FOR BLOCK 1 
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TABLE 540 
EXPERIMENTAL RESULTS FOR BLOCK 2 
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TABLE 541 
ANALYSIS OF VARIANCE (2m=64, r=1, 2 BLOCKS) 
Source d.f, 8.5. M.S. F 
“Main Effects” (62) (427.47) 
ABC (Type of Raw Material) 7 22.48 3.21 .76 
DE (Acid level) 3 93 .80 31.27 7.43 
F (Acid Type) 1 25.95 25.95 6.16 
G (Additive Level) 1 14.30 14.30 3.40 
DEF 3 81.04 27.01 6.42 
Others 47 189.90 4.04 .96 
Dev. Subtr. 930 3720.53 4.00 .95 
Avg. Preferences 992 4148.00 
Order Effects (480) (2159 .00) 
Avg. Order Effect 1 6.55 6.55 1.56 
Resid. Order Effect 479 2152.45 4.49 1.07 
Error 512 (2147.00) 4.21 
Total 1984 8464.00 








The treatment combinations are given in Table 536a, the experimental 
results in Tables 537-540, and the analysis of variance in Table 541. Significant 
effects were attributed to acid level, acid type, and the interaction of acid level 
and acid type. The averages for the eight combinations of acid type and level 
are given in Table 541A. It can be seen from the averages that preference reaches 
a maximum at the second level for Acid Type 1 and at the third level for Acid 
Type 2. With both types at their best levels, Acid Type 2 is more preferred 


than Acid Type 1. 


TABLE 541A 


AVERAGES FOR THE COMBINATIONS OF ACID TYPE AND LEVEL 














Acid Level 
Acid Type — 
1 2 3 4 
1 — .053 .211 — .027 — .449 
2 —.141 .066 .338 .055 








The mean squares for the other factors are not large enough to be significant. 
The mean squares for the remainder of the interactions, for deviations from 
subtractivity, and for order effects agree well with the error. 


7. REMAINING PROBLEMS 


a. Blocking was done in two blocks. It may be necessary to carry out larger 
experiments in four blocks, which certainly would be advisable because 
of the waste of degrees of freedom on subtractivity, order effects, and er- 


ror. 
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b. Balanced sub-sets. Since we have very high precision in the larger experi- 
ments, it may be desirable to omit some pairs in a block. The main con- 
sequence of this would be to reduce the degrees of freedom for subtrac- 
tivity. 

REFERENCES 


{1] Daniel, C., “Fractional replication in industrial research.” Proceedings of the Third 
Berkeley Symposium on Mathematical Statistics and Probability, 1954. 

{2} Davies, O. L., The Design and Analysis of Industrial Experiments. London: Oliver and 
Boyd, 1954. 

[3] Jackson, J. E., and Fleckenstein, M., “An evaluation of some statistical techniques 
used in the analysis of paired comparison data,” Biometrics, 13 (1957), 51-64. 

[4] Scheffé, Henry, “An analysis of variance for paired comparisons,” Journal of the 
American Statistical Association, 47 (1952), 381-400. 

















DESIGN AND OPERATION OF A DOUBLE-LIMIT 
VARIABLES SAMPLING PLAN 


AcueEson J. DUNCAN 
Johns Hopkins University 


The paper presents and illustrates the use of three charts and a table 
for the design and operation of a double-limit variables sampling plan. 
Two of the charts are plots of the Lieberman-Resnikoff tables of the 
minimum variance unbiased estimate of the proportion of a normal 
universe exceeding specified limits given sample estimates of the mean 
and standard deviation or average range. The table is an extension of 
the Patnaik table of conversicn factors for use of the average range in 
lieu of the standard deviation. 


1. INTRODUCTION 


| yee: and Resnikoff [2] have published a set of variables sampling 
plans that were based largely on Blackwell and Girshick’s minimum-vari- 
ance unbiased estimator of the fraction defective of a normal universe. The 
article contained a standard set of 16 plans and their operating characteristic 
curves, together with tables for use in operation of the plans. These plans are 
for the most part identical with those of the new variables standard of the 
Department of Defense [6], Mil. St. 414. 

Whereas many will be willing to use one of the plans offered by Lieberman 
and Resnikoff [2], there may be others who would prefer to design a variables 
plan to meet their own special needs. For single limit plans, standard procedures 
have been known for some time for deriving a variables plan to pass through 
two points on an operating characteristic curve [7]. The special feature of 
interest in the work of Lieberman and Resnikoff is that they offer statistically 
efficient plans for situations in which there are double specification limits. It is 
the purpose of this article to present a set of charts and a table that will be 
helpful in deriving and operating a double-limit variables sampling plan of the 
Lieberman-Resnikoff type. 


2. DESIGN AND OPERATION OF A DOUBLE-LIMIT SAMPLING PLAN BASED 
ON THE MEAN AND STANDARD DEVIATION OF THE SAMPLE 


For the sake of brevity we shall call the Lieberman and Resnikoff type of 
plan a p* plan. To design a p* double-limit plan that will have a given i, pas, 
a and 8, where 1—a is the probability of accepting lots of p; quality and @ is 
the probability of accepting lots of pz quality, we take the following steps: 

(a) We design a single-limit plan for these same p,, po, a and @. For this the 
formulas offered by Wallis [7] may be used, viz.,' 





1 For a more precise method see [5]. 
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K.K2 + KsKi 








k= 
1 Dado (1) 
2(Ka + Kg)? + (KaKe2 + KsK,)? 
n= 
2(Ki — K:)? 
where K, is defined by 
2 en /2 
— dz = 2 
4. V2n ‘ ™ 
and K,=K,, and K:=K,g,,. 
(b) For the k and n so derived, we find 8,* from the formula 
1— v k 
n—1 
hee ea (3) 


given by Lieberman and Resnikoff [2]. We then enter Fig. 545 with an ab- 
scissa value equal to 8,* and for the computed value of n read or interpolate 
the value of p*. Fig. 545 is merely a graph of the incomplete Beta function 
for equal parameters and was derived from Karl Pearson’s Tables of the Incom- 

plete Beta Function, taking p*=B(p, q), 8,*=z and n=2p+2 with p=q. 
The two steps stated above will yield a double-limit p* plan for specified 
Pi, P2, a and 8. This plan will be operated as follows. Take a random sample 
of size m and compute X= )>X/n and 
DX (xX — X)? 

s = — — 








n—I1 
Compute 
i U = 


xX 
C.= and Cy = , 
8 s 








where LZ and U are the lower and upper specification limits respectively, and 
from Fig. 546 determine f; and fv. If $1+/u <p*, accept the lot; other-vise re- 
ject it. Fig. 546 is merely a graphical representation of Lieberman and Resni- 
koff’s Table IV. In most cases it can be read with sufficient accuracy for practi- 
cal operation of a plan. If greater accuracy is desired, the table should be used.? 

To illustrate the above procedure let us design a double-limit variables plan 
for which p,=0.014 and p2.=0.09, a=0.05 and 8=0.10. We shall have from 
(1) and tables of the normal probability distribution 


_ (1.6449) (1.3408) + (1.2816) (2.1973) 
: (1.6449 + 1.2816) 


2 The graph has some theoretical interest, in that it affords a ready comparison between the minimum variance 
unbiased estimate of p and the estimate frequently made by treating Cy, =(X —L)/s, for example, as a normal 
deviate. It indicates that these estimates are approximately the same for n >100, but differ markedly for small n. 
For low Cz, the minimum variance method yields a slightly higher estimate, but for larger Cz a much smaller 
estimate of p. The same statements, of course, pertain to Cy. 





k 1.716 

















DOUBLE-LIMIT VARIABLES SAMPLING PLAN 545 


and 


2(1.6449 + 1.2816)? + [(1.6449) (1.3408) + (1.2816) (2.1973) |? 
n= 





2(2.1973 — 1.3408)? 





05 «410 IS 20 25 380 35 40 .45 «50 Pps 


Fie. 545. Chart for determining p* from f,*. (Graph of 
incomplete beta function, equal parameters.) 


For average range plans: 


1 —k/Vv 


B,* = 2 


For standard deviation plans: 
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With k=1.716 and n=29 we have 
/29 
1 — —— (1.716) 
28 


B,° Ss 2 = 0.335, 





and Fig. 545 yields a p* of about 0.040. The desired plan is to take a sample of 
29, compute 


X-L U - 
Cr = and Cy = 








8 


and estimate p, and fy from Fig. 546. If the sum $,+/y <0.040, accept the 
lot. For example, suppose L=500 and U=700, and suppose the mean of a 
sample of 29 is 570 and 


paCaee 


n-—1l 


: = 


is 40. We then have 


570 — 500 700 —570 
Cy = ————— 2 1.795 ahd (Cy @ ———— = 
40 40 


3.25. 


0.0! 
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10 1.4 
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80 100 


n 


Fig. 546. Chart for estimating p from the quality index C and sample size n. Standard 
deviation plans. (Chart of Lieberman and Resnikoff [2], Table IV.) 








DOUBLE-LIMIT VARIABLES SAMPLING PLAN 547 


Fig. 546 yields £6, +0.037 and y=0.0001. The sum ath 0.0371. Since 
this is less than 0.040, we accept the lot. 


3. DESIGN AND OPERATION OF A DOUBLE-LIMIT SAMPLING PLAN BASED 
ON THE MEAN AND AVERAGE RANGE OF THE SAMPLE 


lor those who prefer to use the average range R instead of s, a similar pro- 
cedure can be employed. Patnaik has shown [3] that 


v(R/d2*)* 


o 


has approximately a x? distribution with degrees of freedom » given by Table 
1AD, where R is an average of the ranges of g sub-groups consisting of m items 
each and d,* is a constant given by the table. When R is to be used, therefore, 
we take the following steps: 
(a) Same as Step (a) for a plan using s. (See Section 2 above.) 

ib) To get approximately the same operating characteristic curve as ob- 
tained by using s, we look in Table 548 for the value of » that is close to n—1, 
where n is the total sample size derived in Step (a). This will yield a new total 
sample size nz =gm where g and m are given by Table 548 for the derived 
value of v. If there is a choice, we select the combination with highest » for 
equal gm. A study by Grubbs and Weaver [1] suggests that when gy is di- 
visible by 8, m=8 will be the optimum sub-group size. Sub-groups of 7 or 9 are 
almost as good, however. This optimum character of sub-group of 8 can be 
seen from Table 548. 

(c) Take 


1 — k/Wv 
2 


and determine’ p* from Fig. 545 with n of the figure equal to »+1. 
This range plan is operated as follows. Take a random sample of gm=npz 
items, subdivide it into g groups of m each. Compute 


c X-L ac U-x 
= =— an = — 
‘ R/d,* d R/d2* 

and from Fig. 549 find £, and fu. Accept the lot if 6.+fuv<p*; otherwise re- 
ject. Fig. 549 is a graphical representation of Lieberman and Resnikoff’s 
Table VI. 

To illustrate let us return to the example worked out in the previous section 
and derive an average range plan for the same values of , pz, a and 8. These 
were p; =0.014, p2=0.09, a=0.05 and 8=0.10. In this case, n—1=29—1=28. 
In Table 548 the gm combination that yields a » closest to 28 is g=3, m=13. 
The next step is to find p*. We have 

1 —1.716//28 


8," = ~ = 0.338. 


* See [2]. This involves an additional approximation in that p > (¥, B is assumed to have approximately the 
same distribution as p (X,41,5,). 


B,° = 
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DOUBLE-LIMIT VARIABLES SAMPLING PLAN 549 


Fig. 545 then yields for 8,*=0.338 and n=29, a p* =0.042. 

Our plan will operate as follows. Take a random sample of 39 items and sub- 
divide it into 3 sub-groups of 13. For each sub-group find the range R and 
compute R. Compute 

X-L X-—500 U-—-X 70-—X 


CL == = and Cy == = — 
“"R/a*  =—-B/3.37 ” R/d* B/3.37 








From Fig. 549 estimate px and fv. If 1+/0 <0.042, accept the lot. 


4. OPERATING CHARACTERISTIC CURVES FOR DOUBLE-LIMIT VARIABLES PLANS 


The operating characteristic curve for a double-limit plan using s (or R 
adjusted for sample size) will be approximately that for the single-limit plan 
using s. (See [2].) Wallis’s formulas for this [7] are: 
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Fic. 549. Chart for estimating p from quality index C and sample size n. Average 
range plans. (Lieberman and Resnikoff [2], Table VI.) 





« More precise results would be given by tables of the non-central ¢ distribution. See [5]. 
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and K, is given by 

© g-2"/2 

= vie 

For example, if p=0.05, K,=1.645 and under the variables plan derived in 


Section 2, the probability of accepting lots of 0.05 quality would be the prob- 
ability that a standard normal deviate should exceed 


bewK, 1.716 — 1.645 0.071 
_ = —— = 0.241 


i E 1 (1716)? 0.295 
(/ tag “sls (/ Sy SE 
= Ste=3) 29° 2(28) 


which is given by the normal tables as 0.40. 





dz = p. 














5. A CAUTION 


The reader should be careful to note that these variables plans are all based 
on the assumption of normally distributed lots. We may state somewhat arbi- 
trarily that as long as p; > 0.01, little error is likely to result if iots are moderately 
skewed or leptokurtic. For a p,; much less than 0.01, however, serious doubts 
arise as to the validity of the operating characteristic curves derived on the 
assumption of normality. Care must also be exercised in rejecting lots under a 
variables plan that have been screened by the supplier and are thus truncated 
at one or both ends. Under such circumstances lots are usually submitted to a 
supplementary attributes plan before final rejection is allowed [6]. 
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EMPIRIC INVESTIGATION OF A TEST OF HOMOGENEITY FOR 
POPULATIONS COMPOSED OF NORMAL DISTRIBUTIONS 


G. A. BAKER 
University of California, Davis 
A test of homogeneity is developed that will frequently detect non- 
homogeneity by means of samples as small as fifty for some populations 


that are composed of normal components. The development is essen- 
tially the same as Smirnov’s distribution free tests as discussed in [4]. 


1, INTRODUCTION 


NE of the most universal causes of nonnormality of populations is non 

homogeneity. This fact was pointed out by Karl Pearson [8] in 1894 who 
considered dissecting nonnormal populations into normal components. Random 
sampling from such populations was discussed by Baker [1] in 1930 with further 
contributions later. A test of homogeneity for normal populations was given by 
Baker [2] in 1941 which was based on ideas developed by Neyman [7] in 1937. 
The problem is still recognized and recurrent as indicated by Preston [9] and 
Rao [10]. For Rao it appears as a problem of classification after nonhomo- 
geneity is demonstrated or assumed. 

Over a period of time ideas of testing hypotheses concerning cumulative 
frequency distributions have been developed by many authors. These develop- 
ments are partially indicated by Bartlett [3], Gumbel [5], Birnbaum [4], and 
Kae, Kiefer, and Wolfowitz [6]. 

These ideas are combined with empirical samplings from six hypothetical 
populations to develop a method for testing for nonhomogeneity and to give 
some idea of its effectiveness under rather widely varying population situations. 


2. SAMPLED POPULATIONS 
The populations from which samples were drawn are: 
(2 — 15.5)? 
(N) ——— exp. — 1/2 —————— 
529 25 
al (x — 15.5)? (x — 32.5) 
(1) exp. — 1 er a 
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The method of drawing samples is the same as that originally described in 
[1]. Each of the populations has a total area of 1296. Thus, 1296 integers dis- 
tributed over a proper range and with the frequencies indicated by the cor- 
responding areas were entered on charts with 6 big rows and 6 big columns of 
squares which were subdivided into 6 little rows and 6 little columns. In each 
case the 1296 integers were distributed in a nonsystematic way among the 1296 
individual spaces. By throwing 4 differentiated dice it was possible to draw 
random individuals from populations that are approximately N, I, II, III, IV, 
and V. The sample sizes considered are 50 and 100. 

The samplings were completely independent for each population and the 
numbers of samples for each were not necessarily the same. The dice were 
ordinary dime store dice ard may have shown 6’s oftener than they should. 
Since the items were scattered and not entered in blocks, little bias seems to 
have appeared in the final results of the distributions of means and variances 
examined elsewhere. 


3. TEST OF HOMOGENEITY 


For a finite sample n we first order the observed values x; from the smallest to 
the largest. Next, we compute the mean and standard deviation of the sample 
and express the observed values as deviations from the mean in terms of the 
standard deviation. Then, we use the tables of the normal areas te compute 
zx, the area from — © to the standardized observed values. We then consider 


Di je | (3.1) 


as a test of homogeneity. 
If the sample is infinite we have 


Di = f ff soaas — ff otoatas| ay 


where 
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g(x) = 
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The values of D, D', and ./D? for the infinite case can be approximated by 
using tables for the normal distribution and repeated numerical and/or 
graphical integration. 


4. THEORETICAL AND EMPIRICAL SAMPLING RESULTS 


First attention is called to the correlation between D, D', and D? for samples 
of 50 as given in Table 553. It is noted that the correlation between D! and 
\/D? is very high, averages 0.985 for the six populations. For this reason further 
discussions are limited to D and D'. It is seen that D is a maximum ordinate 
and that D' is essentially an average of the absolute ordinates. 

In Table 554a we compare the mean maximum ordinates D and the mean 
average absolute ordinates D' for samples of 50 and 100 and for (3.2) for the 
six populations. 

It seems that it might be of interest to present the frequency distributions 
of D and D' in some detail and this is done in Tables 554b and 555. This presen- 
tation permits an estimate of the power of these tests to be made by inspection. 
To facilitate the comparisons of the two measures of nonhomogeneity for the 
various populations we give the per cent of the frequencies beyond 0.13 for D 
and beyond 2.1 for 50 D'. We use 50 D' simply to avoid the bother of division 
of so many items by 50. 

In Table 556 we give results for samples of 100 from populations N and I 
for D and 50 D'. 


5. DISCUSSION 


We see that, even for samples of 50, for all populations D' and 4/D? are al- 
most perfectly correlated. Thus we have considered only D and D*' in detail. 


TABLE 553 


CORRELATIONS BETWEEN D, D', ./D? FOR SIX POPULATIONS 
FOR SAMPLES OF 50 








Population No. of Samples Tpp’ 





N 100 883 
I .826 
II -852 
III .896 
IV .840 
cs 731 
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TABLE 554a 


MEAN MAXIMUM ORDINATES, D, AND MEAN AVERAGE ABSOLUTE 
ORDINATES, D', COMPARED FOR SIx POPULATIONS FOR 
SAMPLES OF 50, 100, AND « 











| Samples of 50 | Samples of 100 | Values of (3.2) 
No. . ca et SEE Se eats 


lation D 


.032 | 0.086 
-051 | 0.110 
.037 
.050 
036 | 


.062 | 


TABLE 554b 


FREQUENCY DISTRIBUTIONS OF D FOR SAMPLES OF 50 FOR 
SIX PARENT POPULATIONS 
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Frequency distributions can be characterized, at least crudely, by their values 
of 6; and 62. For this reason 8, and 6. are given for the parent populations in 
Table 554a. 

The mean values for samples of 50 and 100 and the approximate theoretical 
values for (3.2) given in Table 554a show that the populations vary widely 
in the rapidity with which maximum discrimination is attained. For instance 
for population I the mean value of D' for samples of 50 is 71 per cent of the 


TABLE 555 


FREQUENCY DISTRIBUTIONS OF 50 D® FOR SAMPLES OF 50 FOR 
SIX PARENT POPULATIONS 
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value for (3.2) while for population V the mean value of D' for samples of 50 
is only 9 per cent of the value for (3.2). 

Table 555 indicates that D' is a better discriminator of nonhomogeneity 
for samples of 50 for all populations considered. However, the improvement is 
very slight for population IV and we cannot conclude that D" is better for all 
populations and all sample sizes. Population IV is somewhat more unusual than 
the other types so we might conclude that D' is to be generally preferred. 

It is clearly indicated that, by using the techniques developed here, non- 
homogeneity of sampled populations can often be detected by using samples 
as small as fifty. 

A referee points out the additional desirable discussion. The power of the 
test to detect nonhomogeneity depends on the alternative distribution and on 
the sample size. The alternatives are here characterized by #: and f2. If we plot 


TABLE 556 


FREQUENCY DISTRIBUTIONS OF D AND 50D! FOR SAMPLES OF 100 
FOR A NORMAL POPULATION (N) AND POPULATION I 
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the power of D and D' for tables 554b and 555 we have 
3 N(11,11) IV(25, 27) I1(14, 33) III(58, 69) 
2 1(47,75)  V(86, 95) 
Be 


By 


where the figures in the parentheses are the per cents of samples which are 
>0.13 for D and >2.1 for 50 D'. It appears that except for an irregularity for 
D for populations IV and II the power increases as 6; increases and as {2 
decreases from 3. 

The similar plot for samples of 100 for populations N and I is as follows: 


3 N(10, 10) 
2 1(49, 95) 
1 

0 

Bi 


where the figures in parentheses are the per cents of samples with D>0.11 and 
50 D'> 1.6. As is to be expected samples of 100 are much more effective in de- 
tecting nonhomogeneity than are samples of 50. This is especially true for D! 
when the alternative is a population I. 

We have not as yet been able to carry this investigation further. 
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programs to analyze current economic statistics, and he promptly put them to use in 
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A Current Appraisal of Concentration Statistics. M. A. ADELMAN, Massachusetis Institute of Technology. 


The new statistics compiled by the Census and in part released by the Senate Antitrust Subcommittee are a 
valuable addition to our stock of basic information. The statistics are sound in concept. The impression that they 
are in any sense inaccurate or invalid is incorrect. They do not measure the degree of monopoly, or even of fewness 
in an economic market; they form a body of evidence, greatly varying in precision, which does bear on the degree 
of fewness. 

Concentration did not increase from 1947 to 1954. It does not necessarily follow that it will not increase in the 
future. The new statistics pose sharply the need for summary measures of concentration. To use measures of in- 
equality would be a tempting but not a good expedient because inequality is not related to fewness and because 
the number of firms is unstable, and its meaning uncertain. Industry and product concentration statistics are 
complementary, not competitive. The chief problems in using them are of horizontal and vertical integration. 
Horizontal integration concerns substitution on the supply or demand side, and is not amenable to general statistical 
analysis. The “depth” aspect of vertical integration is subject to analysis and the chief tool available is some form 
of the ratio of materials purchases to sales. There appears to be wide intra-industry variation in this ratio. The 
identification of el ts r ible for greater or lesser vertical integration, and of product flow, poses many 
problems, all expected to be illuminated by release of Census data now in process. 


Rate of Change Approaches to Forecasting—Diffusion Indexes and First Differences. SiIpney S. ALEXANDER, 
M. h “ee Tr obePecd, of T. h. J oY 


A test of the predictive power of a current diffusion index based on fifteen to twenty-six of the components 
of the Federal Reserve Board Index of Industrial Production, as compared with the first differences of the FRB 
Index itself, leads to the following conclusions: 


1. The smoothing operation is necessary but it may be performed currently with a loss in lead time that is 
small compared to the length of lead time involved. A currently smoothed diffusion index of the FRB 
Index P ts led busi cycle peaks, in the interwar and post-war period, by 6 to 12 months. 

2. The differencing operation is also fundamental but it may be performed currently on an aggregate, such 

as the FRB Index, about as effectively as on the components. 

. The transformation of the quantitative differences to a two-valued system of attributes: rising and falling, 
is not essential to the lead over turning points. The first difference of the FRB Index, which preserves 
quantitative relations, leads turning points just about as well as does the corresponding current diffusion 
index. A diffusion index based on a large number of series may be somewhat smoother than the first difference 
of the corresponding aggregate, however. 
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4. Nor is the weighting operation fundamental, since the equally weighted diffusion index and the unequally 
weighted first difference of the FRB Index behave similarly with respect to their leads over business cycle 
turning points. 

5. While the use of hindsight is not fund tal to the lead over business cycle turns, hindsight is of funda- 
mental importance in avoiding false forecasts, or more accurately, in distinguishing forecasts of minor 
fluctuations from those of fluctuations large enough to be called business cycles. 


In short, a currently smoothed first difference of the FRB Index has about the same predictive value as a 
currently smoothed diffusion index based on the components of the FRB Index. Both lead all recognized business 
cycle peaks in the FRB Index (1919-1937, and 1947-1956) by 6 to 12 months, but both also reflect numerous 
minor intermediate fluctuations with a lead close to zero. The signals for a minor fluctuation do not seem to be 
contemporaneously distinguishable from those for a “real” business cycle turning point. 





Some Aspects of Factorial Experimentation. R. L. ANpeRsoN, North Carolina State College. 


Factorial experimentation was considered as to number of levels for each factor; nature of the levels (discrete 
or continuous) and corresponding experimenta! models; blocking arrang ts (confounding); methods of estimat- 
ing error variances; and methods of reducing the ber of factor combinations used in an experiment (such as 
fractionalization). The Galois Field method of founding and fractionalizing was discussed for 2 and 3-level 
experiments. For singly and fractionally replicated experi ts, it was proposed that research was p meatied on the 
possibility of developing a sequential procedure to estimate the error variance. 

Special consideration was given to the design and analysis of confounded factorial experiments, using ortho- 
gona! polynomial contrasts. The loss of information on first and second degree contrasts was derived|for a number of 
confounded multi-factor experiments, using the usual confounded factorial arrangements (such as those based on 
Galois Fields). A new method of constructing such confounded designs based on the signs of elements in the orthog- 
onal polynomial contrasts has been developed by Van der Reyden. 

The recently developed response surface designs were discussed, with certain cautions regarding their general 
use. Finally split-plot experiments were mentioned, with an indication of the effect of interaction on the estimates 
and tests of hypotheses. 














Measuring Labour Stability and Its Application to Employee Recruitment. J. E. ANprew, United Steel Co. Ltd. 
England. 


The increasing concern in industrial labour wastage is stressed, and an outline given of the principal object® 
of a labour turnover investigation. The inadequacy of the pted measure of employee stability—the labou™ 
turnover rate—is illustrated, and alternative measures proposed involving the derivation of a survival function’ 
Two methods of estimating this function are given yielding survival or stability curves. The approximation of 
these to a logarithmic normal form has facilitated the estimation of an expectation of service which is offered as a 
single measure of labour stability. Examples of the application of both methods to data collected in The United 
Steel Companies Ltd. are given. Research into factors underlying differences in stability between branches and be- 
tween smaller groups of men is illustrated, and suggestions are made as to how this knowledge may be used in 
assisting labour managers towards labour budgeting and personnel selection. The age and living circumstances of 
the entrant in particular are shown to have a powerful effect upon the service that may be expected before resigna- 
tion or retirement. The paper ends with details of proposed future studies into factors influencing labour stability. 











The Implications of the Highway Program as an Important Factor in the Long-Run Economic Outlook. L. Jay 

Arxinson, Department of Commerce 

The Highway Act of 1956 provides a long-term stepped-up spending program for highways, with emphasis on 
interstate network and increased Federai share. Spending, including State matching funds on Federal Aid projects 
was about $1.7 billion in 1956, will nearly double within a few years and gradually rise to about $4 billion in the 
final stages. All of the spending will not be “net additional” g. Toll road spending—now at a $1 billion 
annual rate—will drop and other projects in the planning stage may now w be financed from Interstate funds. Strictly 
pay-as-you-build Federal financing will reduce the inflationary impact of the road program and tend to limit State 
borrowing. Estimated trust fund receipts available for Interstate spending may be inadequate for satisfactory 
progress on the Interstate program after the preliminary planning stage is passed. They are estimated to provide 
for Interstate spending of about $1.5 billion annually for the years 1960-65. Short term direct effects will be on 
materials, capital equipment, and skilled labor for highway construction. Longer term effects are stimulation of 
automobile industry, trucking, petroleum, and “roadside” group. Partial offset is in diversion of traffic and in- 
vestment from other forms of transport, chiefly rail and transit. Interstate program will serve about one-fifth of 
traffic needs with fine roads, but other pressing traffic needs will be partly met by some increase in regular Federal 
aid and by rising State financing. For rural areas, traffic congestion is principally on interstate routes, and this 
will be largely relieved by the modern freeway system. More than half of the funds are allocated to urban areas. 
The urban roads will expedite a part of the city flow, but will attract more traffic. The net effect upon city traffic 
congestion will vary with size of city, transportation pattern, and other steps taken to relieve bottlenecks, especially 
the commuting jam. 








Probability of Runs in Correlated Data. Ratpa Horr Bacon, General Precision Laboratory, Inc. 


Consider a sample of n specimens drawn from a “normal” population or universe, where the successive drawings 
are not statistically independent, and where n is a large number (say 50 or 100). A striking feature of such samples 
is the presence of runs (or sequences) of specimens all on the same side of the sample average. It is of interest to 
try to obtain an expression for the probability of runs of length r as a function of the correlation coefficients between 
all the pairs of specimens constituting the sample. As yet, the rigorous solution has not been obtained, but an approx- 
imation, which cannot be in error by more than 10% in the worst case, will be described. 
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The Demand and Supply of Statisticians from the Standpoint of Colleges. T. A. Bancrorr, Iowa State College. 


The present and anticipated extreme shortage of able statisticians to fill academic positions in teaching, re- 
search, and consulting in statistics is, of course, part of the general growing dilemma in higher education, The 
over-all situation is spelled out in the recent report of the President's Committee on Education beyond the High 
School. Quoting from an editorial in the August 19, 1957 issue of the Washington Post, “The gist of this group's 
findings is that we are in an “explosive” period—population-wise and in the expansion of knowledge—and that the 
resources to meet the new demands for education are not at hand.” The discipline of statistics, however, is in a par- 
ticularly desperate situation due to the fact that it is academically a relatively new discipline as a separate field 
and at the same time finds itself in an “Explosive” period of development. 

Modern procedures of statistical inference found early acceptance and use in the biological and agricultural 
sciences. In fact, many of the basic concepts and much of the basic theory and methodology were developed to 
provide research methods and techniques in the agricultural and biological sciences. During and subsequent to 
World War II industry, public health and medicine have discovered the great importance of statistics as a tool of 
research and management. Competition for the few students being trained in statistics in the few well established 
university departments of statistics by government agencies, industry, and the universities has reached an alli 
time high and is mounting. Since the universities can be outbid even in competition for their own staffs, some 
devices must be found to retain able statisticians in academic positions to increase the supply of statisticians avail- 
able for all. University positions should be made more attractive by making salaries more comparable with those 
in industry. Industry and government itself could help support the statistical programs at universities by estab- 
lishing scholarships, fellowships, and supporting contractual research at the universities. The more intangible 
advartages of an academic career should be pointed out to able young statisticians, e.g. an intellectual environment, 
security, and freedom to engage in independent research. 


Selected Stock Market Indicators and Their Current Implications. J. E. Banxs, Brown Brothers, Harriman & Co. 


At the outset the limitations of any stock market indicators should be emphasized. Their value lies in their use 
as an aid in providing perspective for the longer-terrn investor. The relationship between stock prices and earnings 
is, of course, basic. This relationship may be ined by puting a regression between the Dow-Jones Industrial 
Average and the quarterly earnings on the stocks used to make up the Average. Current earnings may then be used 
to estimate a so-called “normal price.” If the level of the Dow-Jones Industrial Average is below the “normal 
price” and earnings are rising, a favorable background for a rising market is suggested. The reverse is also true. 
An index which may aid in anticipating earnings trends is the ratio of bank debits to bank loans. This ratio is 
analogous to a ratio of sales to inventories but moves earlier. The debit-loan ratio also performs a second function 
in that it reflects developing financial strain or ease which may affect stock prices. A collateral indicator of the 
financial situation is the ratio of bank investments to bank loans. Yield relationships are also helpful at times. A 
tatio of the Dow-Jones Industrial Average yield to the yield on Moody's Aaa Corporate Bond Average has on 
occasion suggested longer term buying zones. 


Regional Aspects of Cyclical Fluctuations. Gzorcs H. Borts, Brown University. 


The analysis of regional business fluctuations has always provided an enigma to economists. Some have main- 
tained that it is not worth while studying the cycle in regional slices of the economy; regions only mirror their 
composition of industrial or other economic characteristics. Others have dissented strongly in the belief that the 
regions have behavior patterns of their own which they impose upon the collection of national characteristics they 
happen to contain. I find myself thrust into this latter group by the findings of this study. The regions (in this case 
the 33 most important manufacturing states) do differ from each other in their cyclical behavior and differ in a way 
which the non-regionalists would deny was at all likely. On the other hand they differ in a way which the regional- 
ists have not conceived. These differences are the subject matter of this report. 

There are two int ting hypoth in the field of regional fluctuations: 

(a) One is that the cycle is transmitted to the region through a major, or export, industry. This means that 
the regional cycle reflects, or in some way, exaggerates the national industrial impulses transmitted to it. It is the 
reflection or exaggeration which constitutes the peculiarly regional element. 

(b) The other hypothesis is that the severity of the regional cycle is in some way tied up with the growth 
patterns that the region expericnces. One observer claims that cyclical severity is associated with growth. Another 
has claimed that cyclical severity is associated with economic decline. 

It is of course not possible because of the scarcity of data to provide complete tests of the two general hypoth- 
eses. The tests which I have been able to make indicate the following: 

(a) To the extent that the first hypothesis can be given content, it is rejected by the data in three out of the 
six cycles which I have studied. In addition it is rejected in a fashion which calls into question the entire hypothesis. 

(b) Cyclical severity is not regularly related to growth, or to decline. It does appear related to retardation. 
That is, states with the greatest severity have simultaneously experienced a deterioration in their rate of growth 
relative to other states. Why this relation should hold is not entirely clear. Earlier studies of cycles in industries 
with differing growth rates have established that cyclical declines became more severe as the industry matures. 
The evidence of this study indicates that this phenomenon may extend in a more general form to state industries 
as well. That is, the over-all drop plus rise becomes more severe as the rate of growth retards in individual states. 


Factors Affecting the Level of Fixed Investment in Manufacturing Industries for the Period, 1951-1955. Murray 
Brown and Herman Roseman, University of Pennsyleania. 








We have attempted to erplain the variation in fixed investment in manufacturing industries during the period, 
1951-1955. For this purpose certain well-known theories of investment behavior have been tested—namely, the 
accelerator hypothesis, the profits theory, and a sales expectations theory (in lieu of actual sales). An introduction 
to the paper reviews the findings of previous empirical studies of investment (Tinbergen, Klein, Chenery, Clark, 











SUMMARIES OF PAPERS 571 


Hickman, Modigliani, Koyck, Meyer and Kuh, and Norton), which served as a point of departure for the present 
study. Previous studies have presented conflicting reports, some indicating profits and some sales as the key ex- 
planatory variable. The present study compares these two variables (among others) as predictors and attempts to 
spell out some of the conditions under which one or the other is the better predictor. The data for this study isa 
subsample of 339 manufacturing firms taken from the SEC-OBE surveys of business investment. The SEC-OBE 
surveys provide data on firms’ fixed investment and expected and actual sales; this has been supplemented by data 
from Moody's on profits, capital stock, and various financial variables. Our sample has been broken up by year and 
by industry; each industry-year group was analyzed by linear multiple regression methods. 


How An Economic Consultant Prepares Presentations for Clients. Henry Bunn, Research Institute of America 


This talk centered on the fundamental differences between the presentation of statistical data for an individual 
company or client and that required in addressing a broad national audi or the bers of a single industry. 

In addressing its more than 30,000 members, the Research Institute is guided by these principles: the typical 
business executive today fully understands that his individual enterprise is closely tied to the future of the economy 
as a whole. But what he has learned to accept and look for are not facts but guidance, that is, interpretation of 
facts. What he wants is firm conclusions and recommendations, not a recital of data and reasoning which lead up to 
these conclusions. He expects to be alerted promptly to any significant shift in economic prospects, but does not 
want to be bothered by economic information when there has been no change. 

Tx contrast to this appeal to a broad national audience, is the approach which must be followed with a more 
narrowly confined group such as the members of a trade association. Legal limitations preclude forecasting which 
might, conceivably, have a direct influence on an individual company’s marketing or pricing decisions. As a result, 
much of an association’s work has to be retrospective, designed to put the industry’s performance in relation to the 
national economy and other industries. The most useful approach to this type of analysis lies in the presentation of 
industry-wide yardsticks by which each individual company can measure the effectiveness of its own operations 
with respect to each element of income and expense. 

The function of both nationally disseminated economic guidance and the specific yardsticks available to the 
members of a trade association is to provide the background and general framework within which the analysts of 
an individual company can more effectively present their own conclusions to top-management. 





Inventory Expectations Data. Ontw E. Bunter, University of Pennsylvania. 


It has been clear for some years that inventory investment constituted one of the most volatile types of busi- 
ness spending. Inventory status data frequently impart instability to the operations of the economy. The aim of 
this paper, however, is the consideration of the role played by inventory plans as related to understanding business 
fiuctuations. In other words, when information on inventories is to be used as data foreshadowing economic change, 
the more pertinent data may well be anticipation of inventory holdings rather than historical holdings. One way 
to develop such a measure is to obtain information on anticipated inventory change directly from business firms. 
The determination of accuracy and timing should begin with a study of the past performance of the data treated 
as a time series. Conclusive evidence would demand that the series have a responsibility to continue in the future 
in essentially the same pattern that it exhibited in the past. 

In its current experimental stage, the body of data available for consideration has no such full history. The 
experimental efforts of the Wharton School of Finance and Commerce study, the quarterly Fortune Magazine 
Survey of Inventory Plans, and the Dun and Bradstreet Survey of Business Men's Expectations do little more than 
suggest the potentiality of inventory plans as a forecasting medium. Nonetheless, on the basis of the implied effi- 
ciency of the data on inventory plans in forecasting the operations of the economy, the United States Department of 
Commerce has undertaken the collection of inventory plans data, on an adequate sample basis, which has consider- 
able promise of adding a significant lead series to business forecasting knowledge. 


Classification by Configuration and Level. E. I. Burpocx, New York State Department of Mental Hygiene. 


Classification of mental patients in terms of performance on a battery of psychological tests can be improved 
by reference both to configuration, i.e. pattern, of scores and to elevation, or level. If the test scores are drawn 
from a continuum with approximately normal distribution, the scores may be normalized. Differences in level may 
be tested by puting each subject’s mean score and comparing the means of the groups within the population 
by the usual analysis of variance methods. It is then possible to test for differences in configuration by ranking 
the tests for each subject, group by group, and transforming the ranks to corresponding normal! deviates, after which 
a test by subject analysis of variance may be carried out in which the mean square for the group X test interaction 
is compared to the error term. If this comparison yields a significant F-ratio, the groups can be said to differ in 
pattern. A three-way analysis may also be devised, as when the groups are divided among both sexes. The between- 
tests mean square may also be compared to the error term to see whether there is a significant over-all difference 
among the tests. However, this ranking method precludes an additional test of differences between groups, because 
the transformed scores necessarily sum to sero across subjects. 

The above described method was used to analyze the test scores of 78 schizophrenic patients at Brooklyn 
State Hospital. The subjects were all between 18 and 50 years of age; 40 were males, 38, f les. The subjects were 
divided into two groups, a chronic group which had spent more than 50°% of the period since first admission in the 
hospital, and a mobile group with less than 50% of the time in the hospital. Nine psychological testa were used for 
the analysis: 1. A verbal analogies test, 2. A verbal test of essential differences, 3. Critical flicker frequency, 4. 
Reading color words, 5. Identifying colors, 6. Naming conflicting color of ink in which color names are printed, 7. 
Reading color words printed in conflicting color, 8. Tapping time, 9. Reaction time. 

Results—Part I (Analysis of Means): F between groups = 4.584; F between sexes =0.665 N.S.; F sex X group 
interaction =0.455. Part II (Pattern of 9 tests): F between tests =0.31 N.S.; F test X group interaction =0,37 N.S,; 
F test X sex interaction =4,90; F test X sex X group interaction =0.42 N.S, 
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Truncation to Meet Requirements on Means. F. Evcene Cuark, Rutgers University. 


The problem presented is that of truncating a normal population with known mean and standard deviation 
in order that the truncated population will satisfy requirements on sample means. The mean of a random sample 
from a lot submitted for acceptance must lie between specified values or the lot will be rejected. The procedure given 
here is to assign points of truncation in such a way that the truncated lot will meet the specifications with a pre- 
scribed risk of rejection. Cases of both single and double truncation are presented. 


Anticipations Data in the Capital Goods Field. Morris Conen, National Industrial Conference Board. 


This paper represents a progress report on the new quarterly survey of capital appropriations conducted by 
the National Industrial Conference Board. The report sets the role of the appropriations survey in the postwar 
develop t of busi statistics and the parallel trends in private business planning. It distinguishes between 
the appropriations approach and the surveys of capital spending intentions, emphasizing that the capital appro- 
priation represents the company decision to spend for plant and equipment. It explains why the appropriations 
survey is limited to the 1,000 largest manufacturing companies, as measured by total assets. It outlines the battery 
of statistical indicators made available by the survey: appropriations newly approved, appropriation backlogs at 
the beginning and end of the quarter, appropriations that have been committed or spent, and cancellations. The 
published data for the ten quarters of 1955-1957 are analyzed briefly, the highlight being the apparent peak in 
new appropriations reached in the second quarter of 1956. Unpublished and preliminary data for 1953 and 1954 
are discussed, in particular the fact that new appropriation approvals rose during the latter part of 1954 and early 
1955 while capital spending declined. Finally, it is suggested that the appropriations series can serve as a short-term 
indicator, foreshadowing changes in manufacturers’ spending for plant and equipment and providing perhaps the 
first clue to changes in such spending which may occur beyond the end of the calendar year. 





Long Range Economic Outlook. Geraarp Com, National Planning Association, Washington, D. C. 


If reasonable full employment should be maintained, a Gross National Product of approximately $600 billion, 
measured in present prices, should be attainable in 1965. What are the existing forces operating in the economy 
which will bring about such economic expansion? 

During recent decades, the most important forces making for expansion have been the following: 1. The 
technological innovations with their impact on business investments; 2. The change in the pattern of living—the 
suburban development with all its consequences for private and public outlays; 3. The high level of government 
expenditures for national defense; 4. The spread of installment buying; 5. The changes in the distribution of in- 
comes—the rise in incomes of formerly low income groups, including the effect of old age pensions on consumer 
expenditures; 6. The rise in international trade activities. Some of these factors are now temporarily declining, 
others are rising or most likely to rise again in the near future. In addition, there is one group of demand factors 
which has been held back in the past but which is becoming of rapidly increasing importance. This refers to nonde- 
fense outlays of the government, such as for education, research, health, highways and skyways, conservation and 
developme nt of water and other natural resources. Considering each of these various demand factors, the long 
range out'ook for continued economic expansion appears most encouraging. There are, however, some factors 
which give reason for serious concern. 

1. The unsolved problems of price and cost rises suggests that a restrictive monetary and credit policy may be 
continued. This policy which is designed to counteract price rises actually dampens expansion. Adequate and 
continued economic expansion requires a constructive reconciliation of the policies of price stabilization and of 
promoting needed expansion. 2. Most of the needed nondefense government services to which I have referred 
are in the area of primary state and local responsibilities. However, the tax sources which expand most readily 
with general economic growth are in the hands of the Federal Government. Thus, there arises the possibility that 
etate and local tax resources may not be adequate to support the desired increase in government programs. This 
unsolved problem of Federal-state-locai fiscal relationship presents a serious obstacle to those development programs 
which are of great importance for continued economic expansion. 

3. Finally, there is a serious question about the future of the international economic outlook. The question 
of the dollar shortage which a few years ago appeared almost solved is coming up again. Continuing increase in 
exports will depend largely on a freer flow of capital from the capital-abundant to the capital-starved nations of 
the world. This flow is now impeded largely by political uncertainties. The conclusion is that the long-term economic 
outlook is most encouraging, provided that through public and private efforts we succeed in removing these serious 
roadblocks. 


Some Experience with Electronic Computers in Processing Economic Census Data. Maxwett R. Conky and 
Owen C. Gretron, Bureau of the Census. 


The paper discusses interesting and promising new developments in the use of electronic computers on economic 
census data by analyzing their application in the 1954 Manufactures Census. Both tabulation and mechanical 
editing phases of the processing are covered. In the tabulation end of the program, stress is laid on plant specializa- 
tion statistics which were published for the first time in any census. The unique ability of the computer to manipu- 
late data and compute ratios is described in detail. 

The aids provided by the computer in detecting erroneous data are discussed. This section of the paper also 
deals with the 1954 Census program of mechanically imputing data for small establishments that fail to report 
certain items or reported them in an inconsistent manner. The electronic computer enabled the Bureau to test 
reported product and materials data against acceptable unit value ranges mechanically for the first time with 
significant savings in both tirae and money. In this part of the talk relating to products and materials there is also 
an explanation of the technique of spreading the tabulated establishment data in unit value frequency distributions 
for review purposes. Finally, some of the limitations of the use of the computer in 1954 are outlined in a section 
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which also describes the Bureau’s plans for overcoming those deficiencies in the 1958 Censuses with the new sci- 
entific computer. The paper points out the vexing problem of introducing corrections into the computer tapes 
and related areas calling for improvement before the maximum advantages of the new equipment can be realized. 


Inconsistency of Preferences as a Measure of Psychological Distance. CLype H. Coomas, University of Michigan. 


The measurement of inconsistency has long been a basis for measurement of psychological distance, as, for 
example, with the jnd concept in psychophysics. This experiment is an intensive study of the relation of measures 
of inconsistency to psychological distance in the context of preferential choice as distinct from the context of psy- 
chophysical discrimination. A theory was tested which predicts that measures of inconsistency of preferential 
choice are not monotonically related to psychological distance unless a new condition, called laterality, is held 
constant. A series of twelve shades of grey were presented in sets of 4 at a time to each of 4 subjects who indicated 
their preferences among the stimuli as best representing their concept of “grey.” Each individual's concept of grey 
turned out to be an intermediate grey with lighter greys on one side and darker greys on the other. The hypothesis 
was tested that inconsistency between pairs of greys whose members were both on the same side of him would be 
of a different order of magnitude than inconsistency between pairs whose members came from opposite sides of 
him. Each subject constituted a separate experiment and the hypothesis was sustained for each of them. The 
significance of this for converting inconsistency measures into psychological distance measures was discussed. 


Modifications of the Correlation Technique for Use in a Process Capability Analysis. Davin Cowan, Hughes 
Aircraft Co. 


Industrial processes are subject to myriad sources of variation, each of which may or may not influence the 
end product dimension. The first step in establishing control of a process is to identify and measure the sources 
which do influence the end product dimension. Correlation techniques, which potentially can both identify and 
measure have rarely been used in process control studies. The intricate and tedious computations connected with 
the usual multiple correlation technique are partly responsible for this situation, but there is a more basic cause. 
The technique does not yield practical measures in terms of a process, and the intrivacies of the mathematical 
model do not lead to an insight into the physical relations among variables. The paper develops the statistic 


P=1-— v1-, 


where P is the per cent of variation in the end product dimension that is associated with a variation in processing» 
p is the population correlation coefficient. An alternative multiple correlation model is proposed which can take 
advantage of existing knowledge of relationships among sources of variation. 


Methods of Inquiry—The Income Approach. Daniet Creamer, National Industrial Conference Board. 


This paper outlines the problems in measurement encountered in determining the extent and socio-economic 
characteristics of low-income families in New York State in 1949. The data for analysis were the income distribu- 
tions of a one-per cent sample of families and unrelated individuals resident in New York State in April, 1950, 
and canvassed by the Bureau of the Census in the 1950 Census of Population. Attention is given to the development 
of an income norm that would mark off low incomes from higher incomes. The basis for setting up the norm is the 
standard urban workers budget necessary for the maintenance of health and decency. Its validity for our purpose 
is indicated by comparison with other budgets for other purposes. Adjustments to the reported income distribution 
were deemed necessary, particularly to allow for under-reporting of income and for improper classification of 
students living away from home. To come somewhat closer to a distribution of permanent income, an adjustment 
is made for the effect of cyclical unemployment on the 1949 income distribution. Applying our low-income yard- 
stick to the adjusted 1949 income distribution, we estimate that about 30 per cent of the combined total of families 
and unrelated individuals had low incomes in 1949 and about 56 per cent of these had incomes below the public 
assistance budget. 


A New Type of Estimate of the Income Elasticity of the Demand for Food. Jean Crockett, University of Pennsyl- 
vania. 


The well-known shortcomings of income elasticity estimates derived either from time series relationships or 
from Engel curves have led to a number of recent attempts to combine the two approaches in the hope of arriving 
at improved estimates. The present paper bases an estimate of the income elasticity for food on the changes over 
time in both food expenditures and income for a constant sample of about 1,500 families. Basic data were obtained 
from the records of the Market Research Corporation of America for the years 1951 and 1953. 

If the family demand for food in the year ¢ is a function of food expenditure, disposable income, family size, 
and other family characteristics affecting consumption, then an attempt to estimate the parameter relating to 
disposable income without the specific inclusion of family characteristics is likely to lead to biased results, since a 
fairly strong association exists between income level and family characteristics. However, if we consider a relatively 
short period of time, many of the family characteristics drop out since no change or negligible change occurs for 
most families in such factors as race, age, occupation, education, location, tenure of dwelling unit, and debt-asset 
position. Even when substantial changes in these variables do occur, it is not clear that they are highly correlated 
with income changes. 

It should be noted that the functional form used implicitly assumes that there are no interaction effects either 

iong the independent variables or between price and any of these variables (since the coefficients are constant 
over time rather than being functions of price). Prices are not necessarily the same for all families (e.g. they may 
be lower in small cities than in large) but are assumed to change in the same way for all so that changes in relative 
prices (as well as other factors acting uniformly on all families) are reflected in a constant term. 
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A shortcoming of the present approach lies in the fact that adjustments of consumption to large (and perhaps 
temporary) changes in income probably are not immediate and complete and may in fact be slower in some periods 
than others if the demand for other commodities such as housing and durables is particularly strong. 
For this reason it was found desirable to replace current year income with a 3-year average income and to omit 
certain cases of unusually high income change. While this raises the estimates of income elasticity obtained from 
the 1951-53 data, these estimates are still somewhat lower than those usually obtained in cross-sectional studies. 
This suggests either that sufficient allowance has not been made for slowness in adjusting food consumption to 
income change or that the ordinary cross-sectional estimates are somewhat too high. 





Accounting Records: A Limiting Factor in Economic Data Collection. Epwanp T. Crowpsr, Office of Statistical 
Standards. 


Economie statistics come to a large extent from business records. Survey planners often give insufficient 
attention to whether the facts they want are available in these records. Availability is not a simple matter, and un- 
less researchers and respondents understand one another as to what a survey entails, there may be unfortunate 
consequences. The problem of availability may be illustrated in various economic fields. The researchers can turn 
to no comprehensive reference literature on what business records contain. He may investigate experience in similar 
surveys, consult business men singly or in groups, and make mail and field surveys of sources and availability of 
data—procedures which should be more systematically used. Better communication and cooperation are needed 
between ecc ists and busi experts on accounting and record-keeping. When data are, in effect, not available, 
abandonment of the plan in whole or in part may be indicated. Fragmentary data from a universe are dangerous 
to use, and research is needed on statistically valid procedures when records are inadequate for the whole universe. 
Businesses may be persuaded to keep special records for surveys, but the long-run solution is voluntary adaptation 
of the accounting procedures to the needs of economists. This can and will be done if business men and the com- 
munity place a sufficiently high value on economic statistics. 





Demand for Farm Products at Retail and the Farm Level. Rex F. Daty, Department of Agriculture. 


This paper isolates the influence of price and quantity variation on expenditures for food, computes price 
and income elasticities of demand, and relates the supply response to price, quantity, and expenditure elasticities. 
Expenditures for food have varied about in proportion to changes in consumer income. However, flexibility of real 
expenditure relative to real income is around 0.45, about the same as for cross-section surveys. The bill for process- 
ing and marketing food, around 60 per cent of expenditures, is somewhat more responsiv. to income changes than 
is total consumer expenditures. Value of products at the farm level showed a flexibility relative to income of 0.9 
due mostly to variation in prices. However, income elasticity of quantity of food consumed at a farm equivalent 
level was only about 0.15—very unresponsive. A simple single equation demand analysis indicated price elasticity 
of demand for food at the farm level was about —0.i2 and income elasticity was 0.18. A knowledge of the supply 
response is necessary to indicate how price, quantity and expenditure for food change with a change in income. 
With a very low price elasticity of demand, the greater the supply response to relative prices the lower the income 
fiexibility of expenditure relative to income. 


Factorial Experiment. Curssert Danrex, New York. 


Fractional factorial 2°-¢ experiments, with p factors, in 2~¢ replication, are generally too large to run in one 
set. If all main effects and all two factor interactions (2A) are to be estimated, the numbers of runs required for p 
of 6, 8, 11, 15, are 32, 64, 128 and 256 respectively. The required fractional replicates are then 2*!, 28-2, 2'-* and 
24-7, Sub-fractions of each of these plans are given, requiring 16 runs, that provide estimates of large main effects, 
each confounded however with one or more 2f. Symmetric confounding patterns are given for these 2?-?-" sub- 
fractions and for the corresponding experimental conditions for each run. The steps of augmentation to two-factor- 
interaction-clear plans are indicated. 

Plotting of the e.c.d. of the collection of N-1 contrasts (main effects and interactions) produced by these 
experiments on a “half-normal” grid is proposed. Such plots facilitate detection of severe.l kinds of defects, espe- 
cially mavericks, inadvertent plot-splitting, and antilognormal error distribution, as well as aiding in the decision 
on what contrasts to pool for error-estimation. Also given are the operating characteristics of a “modulus ratio test” 
as found by A. Birnbaum and confirmed by some sampling experiments, 


A Three-Sample Zolmogoroff-Smirnoff Test. Hersert T. Davin, Iowa State College. 


A three-sample Kolmogoroff-Smirnoff statistic Ds, ,» =Max [sup { Fy, a(t) —Frx, n(t)}, sup { Pz, a(t) —Fy, a(t} 
sup | Fx, a(t) —Fz, n(t)}] is considered. The test based on Ds, n is especially powerful against alternatives of type 
X<Y<Z or ¥<Z<X or Z<X<Y. The distribution and asymptotic distribution of Ds, » are, respectively, 
Pr{Ds, »>I/n} = 

[nf] 
32 > (4) (nl*/(n — Dla4+)i(n + G — DDI, 
fmt f€J(i) 


lim Pr {n/2D3,, 2} =3 3 z (+)e [7], 
no i=l = jeJ(i) 


where the set /(i) consists of the integers (2—i, 3—i, 5—i, 6 -i, 8—i, 9—i, 11 —i, 12—i, .. . , 2—i), and where 
the (+) sign indicates that, for fixed i, successive terms in the finite series indexed by j have alternating signs, 
beginning with + for j=2—i, — for j =3—i, + for j «5 —i, ete. The distribution is obtained by solving a random 
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walk on a triangular lattice in the plane, with triangular absorbing boundary; the solution involves a reflection 
principle in the plane. 


Experiences with Response Surface Experimentation. Roszrt M. De Baun, American Cyanamid Company. 


About one-half the work of the experimental statistics staff of a large chemical research laboratory is provided 
by fitting data to empirical response surfaces of the Taylor series type. It is urged that such work is not applied 
effectively when conducting “post-mortems” on previously gathered data or when the mathematical model of the 
physical system is even approximately understood. It serves best when the mathematical model of the system is 
only poorly understood, when fairly precise experimentation can be conducted and when an approximation to the 
true functional relation may be useful, either for empirical prediction or as an aid in arriving at an understanding 
of the system. Two examples are given, both uf which show features characteristic of this type of experimentation. 
These are: multiple response variables, “ridginess,” i.e., the absence of well-defined point maxima or minima, and 
the presence of additional non-continuous independent variables, such as block effects and discrete (qualitative) 
variables. 


Tomorrow’s Investments. Harotp B. Dorsey. Argus Research Corporation. 


The investment manager today cannot formulate his decisions on tomorrow's investment policies without 
coping with the question of inflation. Investment analysts have been seeing many practical violations of the theories 
that are supposed to support the idea of inevitable inflation, and, therefore, those theories are beginning to lose 
adherents. Noting the weakness in the prices of copper, lead and zinc, and in the earnings and stock prices of the 
producers of those metals, the theories of the wage-price spiral do not seem to be working out in practice. Higher 
wages are not being offset by higher prices, and the stocks of these natural resource companies are not functioning 
as a hedge against inflation. Nor are the theories of inevitable inflation working out for the railroads, or the airlines, 
or for the producers of building materials, textiles, home appliances, certain lines of paper, and numerous other 
seg ts of the y. 

It is quite clear to the investment analyst that some of the foregoing industries are now suffering from an old- 
fashioned over-expansion. We may not be much further along in the inflation cycle than many people suspect. 
The current debate about the inevitability of inflation fails to give enough consideration to tbe counter-inflationary 
effect that rising prices have upon consumption. The work that is being done by my staff suggests very strongly 
that administered wages and prices have not been able to overcome the simple and basic economic fact that rising 
prices tend to curb demand and encourage supply. The current inflationary condit:on is not a new, mysterious, 
economic malady for which there is no cure, and, therefore, investment policies should not be geared to the pattern 
of persistent inflation, with its extreme emphasis on investment in equities. The pattern of the investment program 
which seems most logical would allow for the possibility that we are going to continue to have business cycles, 

On the basis of the fundamental economic forces, the investment manager should think in terms of carrying 
a balanced investment portfolio, i.e., balanced between good quality, fixed income securities and equities, and, in 
the latter group, his organization would be inclined to look with more favor on the defensive issues, until there is 
more assurance that the inflationary psychology of the last five years is not going to flop over to the deflationary side. 





Communicating Statistical Findings to Management. Stanrit Epmunps, Ford Motor Co. 


Economic analysis in the Ford Division provides part of the factual basis for planning on several specific 
operating problems. These are: production, product, financial, and facilities planning. The most continuous and 
exhaustive part of the statistical work is related to production planning, that is, planning production schedules 
throughout all the assembly plants and supplier divisions which will keep the output of cars in line with the retail 
market. The automobile industry operetes with a very low inventory position compared with most other businesses. 
The amount of stocks of cars between the end of the assembly line and the consumer's garage is generally not more 
than 30 to 40 days’ supply. This compares with an inventory of upwards of 60 days’ supply in most other lines of 
business. To maintain a continuous flow of automobiles with moderate inventories requires that the production 
lines operate at approximately the same rate as consumer buying of new cars. 

The cycle of product planning in the automobile industry is relatively long in that the product must be styled, 
engineered, and tooled as much as three years before actual production. The planning of a product which will 
appear on the market three years hence requires a close look at the kind, the number, the price ranges, and styling 
characteristics of the cars which are to be sold in the future. The relation of statistical and economic analysis to 
financial planning bas two aspects. The first is to establish a reasonable and conservative base upor which tooling 
facilities, and product costs can be projected into the future. This requires the establishment of a standard volume 
of cars which can be sold and over which the heavy overhead costs can be spread. The second aspect of financial 
planning to which economic analysis contributes is in the area of pricing. Here the problem is to determine the 
volume of cars which can be sold at various price intervals. This helps determine the amount of cost which can be 
incurred end built into the product in relation to what the consumer wants and is willing to pay. The longer term 
projections of market conditions help to establish, also, the production facilities which will be required in the future. 
Hence, long range market projections help establish the expansion and investment policies which are appropriate 
to keep the pany ina titive position in relation to the total size of the market. 


y 








Bank Credit and Inventory Cycles. Donts M. Eisemann, Rand Corporation. 


Changes in the availability or cost of bank credit may influence variations in business inventories. The propor- 
tion of inventories financed by bank loans increased during a cyclical inventory expansion and decreased during 
the contraction. At the peak of inventory accumulation, in the first quarter of 1951, almost one-third of manufac- 
turers’ inventories was financed through short-term bank borrowings. The correlation bet the lly ad- 
justed quarter to quarter percentage changes in loans and inventories for manufacturing between 1947 and 1955 
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was .91. During the three periods of credit restraint cince 1951, the loan to asset ratio of small firms generally de- 
clined while that of large firms increased. As large firms also had a low correlation between changes in loans and in- 
ventories, we concluded that the level of their inventories would not be influenced by changes in the credit policy 
of banks. Inventories of small and medium sized firms were found to be sensitive to such changes. 

Industries with large inventories in relation to their assets, and with a high ratio of loans to inventories, had 
the closest link between changes in loans and changes in inventories. Presumably, a change in monetary policy 
would influence the level of inventories of these industries more strongly than the inventories of other industries. 
For the period as a whole, the average magnitude of seasonal fluctuations was considerably larger than that of cy- 
clical changes, for both loans and inventories. This points up one of the difficulties of using monetary policy to 
mitigate inventory cycles. 


Karl Pearson: An Appreciation of One Aspect of His Work. Caurcaitt Eisennanrt, National Bureau of Standards. 


An attempt is made to contrast briefly and as sharply as possible the differences in aim of Biometry on the 
one hand, and of Adjustment of Observations on the other. In a nutshell, the distinction is this. In Adjustment of 
Observations, variation is not a fundamental characteristic of the natural phenomena under study but creeps in 
through the imperfections of our instruments and our senses, and is thus something unwanted that has to be lived 
with. The aim of Adjustment of Observations is to minimize its influence. On the other hand, in Biometry, variation 
is recognized as an inherent characteristic of natural (at least tiological and social) phenomena, and is neither to 
be ignored nor suppressed, and the aim of Biometry is to give proper expression to it and to measure it. Whereas 
the arithmetic of the Method of Least Squares, as developed by Legendre and Gauss, and the arithmetic of Cor- 
relation and Regression, as developed by Galton and Pearson, have much in common, the objectives of the two 
approaches are quite different; and Pearson’s work is not merely an elaboration on the word of Gauss, but a tre- 
mendous step in an entirely new direction. 





Presentation of Statistics to the Management of a Large Chemical Company. Ina T. Extis, Z. J. du Pont de Nemours 
and Company. 


The functions of the Economist's Office in du Pont are to study and interpret major business trends and de- 
velopments. We report to the Executive Committee, the top operating committee of the Company. Our staff 
includes four men who are specialists in selected fields of economic research. One is an expert on commodity price 
trends; another is a specialist in the fields of economic theory and foreign trade. A third is a specialist in growth 
rates of particular industries. He is particularly familiar with the construction industry and population trends. 
Our fourth man has a a pues in chemical market research. 

Our principal method of « i ic information to management is through a meeting one hour 
in length on the third Tuesday of each month with the members of the executive and finance committees. Topics 
to be discussed are presented in the form of a report distributed the previous Friday. The entire hour is used for 
discussion. There is usually no formal presentation other than the report itself. The general theme about which 
our discussion subjects cluster is those factors that affect or are iated with ic expansion—the broad 
area of responsibility of these men—rather than the day-to-day activities of the Company. In addition to the 
above specific duties, we also prepare estimates of the probable business development one, or two, or ten yeurs ahead, 
for use by the manufacturing departments in preparing sales forecasts covering various periods. Our estimates 
are usually restricted to such measures as the Federal Reserve Board Index of Industrial Production, indices of 
wholesale prices, and the consumer price index, wage rates in manufacturing industries, etc. These are usually 
transmitted in tabular form covering reported data for the past five years, together with estimates for the required 
future years. 

While we prepare a variety of measures covering the entire economy, we also prepare a sales price index repre- 
senting all our sales and one for each manufacturing department. We also prepare indices of “quantity” sales. Our 
commodity specialist maintains an index of market prices covering all the principal raw materials purchased by 
the Company. We also initiate special studies at irregular intervals, which are usually issued as formal reports. 








Methods of Estimating Inventories of Passenger Cars. Witu1am C. Fianerry, Chrysler Corporation. 


The discussion of dealers’ inventories includes the methods of preparing estimates by Chrysler Corporation, 
by the automobile industry, and by groups outside the industry. In addition, consideration is given to the useful- 
ness, reliability, and comprehensiveness of the estimates. The treatment of consumers inventories is based on the 
number and age distribution of cars in use. Attention is paid to the usefulness of estimates for both short run and 
long run policies. Considerable care is paid to techniques of estimating future inventories both quantitatively and 
qualitatively. Mention is made of specific problems such as the impact of shifting scrappage rates for different 
year models. 


Avoiding Nonsense Correlations in Time Series. Ricuarp Foorn, Connell Rice and Commission Company. 


Nonsense correlations in economic analysis are those which reflect an historical accident rather than a basic 
economic cause. They may be dangerous for use in forecasting because the accidental relations that engender them 
can cease to exist at any time. More reliable forecasts usually are obtained from analyses that are based on the 
structural equations that are believed to make up an economic system, or their reduced form counterparts, even 
though such equations may give a lower coefficient of multiple correlation than can be obtained for other analyses 
during the period covered. Before using a study to make forecasts, charts should be made that illustrate the partial 
relationships between the dependent variable and each of the independent variables and the unexplained residuals 
should be plotted against time to determine whether they exhibit a trend or some other departure from randomness. 
Ideas obtained from an examination of such charts sometimes suggest sound modifications that need to be made 
in the analyses. 
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Interrelationships between Investment and Sales Anticipations. Murray F. Foss, Department of Commerce. 


Part I deals with the realization of business investment anticipations in 1955—a year which is of special interest 
in the study of factors affecting investment since it was a turning point year and one which saw a larger-than- 
average discrepancy between business investment expectations and results. The paper presents the results, cross- 
classified with respect to several variables, of a special questionnaire sent to manufacturing firms requesting ex- 
planations for deviations from 1955 plant and equipment expenditure anticipations as reported in the Office of 
Business Economics-Securities and Exchange Commission survey for that year. Results of relationships between 
investment deviations and deviations in sales and profits, both actual and anticipated, are also discussed. 

Part II deals with factors affecting the investment anticipation (as distinct from actual investment) through 
a cross-sectional analysis of manufacturing firms. The paper presents, for the years 1954-57, relationships between 
expected investment, on the one hand, and ex post and ex ante variables on the other hand, such as sales, profits 
and depreciation. 


Supply and Demand for Statisticians in Federal Service. Davin L. Furransxy, Bureau of the Census. 


Discussion is restricted to those positions officially designated as Statistician. It will exclude all other groups 
such as Social Science, Economic, Mathematics, which may be applying and developing statistical methods but 
whose duties are not considered to be primarily in the field of statistics as defined in the Federal Classification sys- 
tem. In the past 10 years, the number of statisticians in the Federal service has increased from 1700 to 2100, with 
most of the increase taking place between 1947 and 1951. They are divided roughly as follows: Analytical Statis- 
tician—62%; Survey Statistician—15%; Statistician—15%; Mathematical Statistician—8%. (Mathematicians 
have increased during the same 10-year period from approximateiy 500 to 2000, with a 50% increase from 1954 to 
1957.) In the nation-wide survey (June 1957), agencies reported a total of approximately 300 vacancies for which 
provision had been made in their budgets. Of these, at least 100 can be considered in the entrance grades of GS-5 
and GS-7. Approximately 40 to 50 of the vacancies are for Mathematical Statisticians. The entrance grades of GS-5 
and GS-7 ($3670 and $4525 per year) are covered by the Federal Service Entrance Examination (the old Junior 
Professional Assistant examination extended to include grade GS-7). This examination has not been able to provide 
enough available Statistician eligibles to meet agencies’ needs. 

The grades GS-9 ($5440 per year) and above are covered by unassembied examinations (rating based on re- 
view of education and experience). The registers for the Washington D. C. area are practically depleted of Mathe- 
matical Statisticians—less than 10 eligibles for all grades. Those on the registers for Analytical and Survey are 
mostly persons already working in the Federal service—of the approximately 320 persons on the registers, 75% 
are in the Federal service. The examination, therefore, acts more as an internal promotion service than as a source 
for bringing new people into the Federal service. Within the Analytical and Survey groups there is only a smattering 
of persons with subject-matter backgrounds in such fields as Demography, Health and Medicine, Physical and Bio- 
logical Sciences. 

The other big problem appears to be one of quality. In the past year some 215 certificates were issued in grades 
GS8-9 to 15, of which about 25% resulted in an appointment. Apparently in many cases the certificates were returned 
unused because of the calibre of persons available for appointment. 


How Results of Statistical and Economic Research are Presented to the Board of Governors of the Federal Reserve 
System. Frank R. Garrietp, Board of Governors. 


One of the means of communication now used at the Board but not on display in Board publications is a 
visual-auditory pr tation—“the show.” This technique has been used since 1950 with Federal Reserve and a few 
other groups, domestic and international. “Each show,” as noted in one of a dozen descriptive paragraphs, “has 
its own pattern, designed to fit the circumstances of the time and the particular audience, with due consideration 
also of what information, both new and true, may happen to have become available recently, from the ticker tape 
or a report on some research project. Whatever the pattern, a try is made to convey a sense of progression from 
introduction right through to conclusion—without being too dogmatic or too persistent about a particular theme. 
The natural inclination of responsible writers to avoid too firm commitments helps to maintain a scholarly tone of 
uncertainty. At the same time recognition of the ity for decision making and of staff responsibility for sifting 
out technical perplexities does tend to bring definiteness into the picture.” 








The Low Income Problem: Basic Concepts. Merepira B. Givens, Interdepartmental Committee on Low Incomes, 
New York State. 


The extent and causes of low incomes and depressed levels of living, which persist even in the midst of pros- 
perity, are subjects of special current interest for research, social inquiry, and public policy. However, even in this 
statistical age we do not know with exactitude the absolute or relative size of the low income population. Perhaps 
of even greater importance we do not have a single, ly ted definition of low income. Administration 
and operations of the numerous public welfare and income security gregrems are producing a wealth of statistical 
detail about the recipients of each program. However, these materials are only of partial aid in developing a com- 
prehensive picture of the numbers and characteristics of families and individuals at the lower end of the income 
distribution. 

Ciearly, a more complete description of the economic position and living conditions of population groups can 
be built from analysis which scrutinizes individual families. A primary source of such details is the decennial census. 
Unfortunately the decennial surveys do not provide any basis for other than national estimates. New York, like 
other states, is therefore handicapped in evaluation of current programs and in ascertaining the extent and type of 
unmet needs. An Interdepartmental Committee in New York State is taking action to fill in the statistical gaps 
in our knowledge of the size and characteristics of low i population. Its studies include (a) estimates of in- 
come distribution in relation to economic status, (b) a statewide survey of selected factors related to level of living, 
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(ec) a survey of characteristics of publie assistance recipients and their households, (d) an ineome-expenditure 
survey among households of unemployment insurance recipients, (e) analysis of standards of need in use by public 
agencies, and (f) study of problems of low income farming. Findings are to be used in framing an action program. 


Application of Analysis of Variance Techniques to Profile Analysis. Samce, W. Greennovse and Seymour Geisser, 
National Institute of Mental Health. 


This paper does not consider the difficult problem of selection to form profile groups but rather the analysis 
of two questions often asked of group profiles for groups already formed on the basis of some external criteria. 
These two questions are: 1. Are the group profiles the same with regard to level? and 2. Are they the same with 
regard to shape? 

Assume & tests (having the same metric) given to each of np individuals in the pth group (p=1, . . . , g) yield- 
ing the following vector of means in the pth group: (#1p, . . . frp). Under classical analysis of variance assumptions, 
the most relevant of which is the independence (or at most equal correlations) among the & tests, it is intuitively 
shown that the appropriate statistic to measure departure from parallelism of the group profiles is the group means 
x test interaction. The appropriate error for this statistic is the individual z test interaction resulting in an F test 
with (k—1)(g—1) and (k —1)(N —g) degrees of freedom respectively, NV = Dnp. 

However, in almost all problems arising in psychology, the assumption of independence is violated and one 
must assume a sampling of vectors from a k-variate normal distribution. Based on an extension by Geisser and 
Greenhouse of results due to Box, the above test can be made approximately in the tabular F-distribution with 
(k—1)(¢ —1) €and (k —1)(N —g)€ degrees of freedom where ¢ is a function of the el ts of the variance-covariance 
matrix. A conservative test is an F with (g—1) and (N —g) degrees of freedom. The test on levels presents no dif- 
ficulty and can be made with an F-test with (g—1) and (N —g) degrees of freedom. Problems of unequal covariance 
matrices and other relevant aspects are also discussed. 





Mass Data Input to Computers, a Progress Report on FOSDIC. M. L. Greenovan, National Bureau of Standards. 
For a ber of lica ti the effectiveness of electronic computers is limited by the effort required for 


transcription of source data to a form suitable for computer input. Automatic equipment for converting data from 
marked documents to magnetic tape has recently been employed at the Bureau of the Census on a number of 
projects. Developed at the National Bureau of Standards, the equipment is called FOSDIC (for Film Optical 
Sensing Device for Input to Computers). In this instrument, microfilms of marked questionnaires are examined bv 
an electronic, flying-spot scanner, the output of which is recorded on magnetic tape. FOSDIC is characterized by 
a high degree of tolerance for practical operating conditions, including provision for use of ordinary pencils or pens 
for sensing “dominant” marks and for freedom of d t size and layout. In the paper, highlights of the present 
FOSDIC and new ones in process will be given. In addition to these mark sensing instruments, the recently com- 
pleted, punched-card film reader will be described briefly. 


Distinctive Characteristics of Psychometric Statisti ‘Probl of Metric. Harnotp Guiiixsen, Educational 


Testing Service. 











The metrics used to report test scores and other psychological “measurements” are open to many serious ques- 
tions. Characteristics of various metrics used in the physical sciences are discussed with particular reference to the 
“addition experiment,” from which these metrics derive their most valuable properties. There are analogous “in- 
ternal consistency experiments” which provide similar valuable properties fur certain psychological ements. 
An illustration is given from the psychophysical method of paired comparisons, based on the law of comparative 
judgment. Multidimensional scaling methods and ability measurements method are also discussed. 








Statistics for Engineers at N. C. State College. R. J. Haver, North Carolina State College. 


Interest in statistics is growing very rapidly among engineering students at North Carolina State College 
Recruiters from industry have been helpful in promoting this interest. Another factor has been the existence of a 
number of industry-sponsored senior year scholarships for engineers who agree to take additional work in statistics. 
At present one undergraduate course and several beginning graduate courses are available to engineers. The first 
course covers random variation, probability, a number of basic distributions, confidence intervals, comparison of 
two averages, tests of significance, simple linear regression and quality control. Quality control is also taught in 
several other departments. Attempt is made to emphasize concepts rather than develop skill in application. No 
mathematics beyond college algebra is used. The three graduate courses most popular with engineers have been 
Applied Mathematical Statistics, Basic Statistical Analysis, and Sample Survey Methods. There is also consider- 
able interest in the IBM 650 which is located in the Statistics department. 


Some Experiments with Demand Relationships for Clothing. Monris Hamsura, University of Pennsyleania. 


This paper reports some of the exploratory work in a study of demand relationships for clothing. The coverage 
is restricted to clothing expenditures as a whole although the completed study will concern itself with sub-groups 
and individual commodities as well. While the work that has been done and is reported in this paper is preliminary, 
the completed study will include a detailed analysis of the effects of a large number of social, economic, and demo- 
graphic variables on clothing expenditures particularly as determined from the 1950 B.L.S.-Wharton budget 
study. 

The firet type of demand relationship for clothing that is derived is a least squares time series multiplo regres- 
sion equation relating per capita deflated clothing expenditures, per capita deflated disposable income and the 
relative price of clothing. Various transformations of the variables are made including first differences, logarithms, 
and first differences in the logarithms. A simple complete structural system is fitted to the same set of time series 
data, utilizing a marginal propensity to consume determined from the B.L.S.-Wharton budget study. Other at- 
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tempts are also made at the integration of cross-section and time series data for clothing expenditures. In all of the 
above-mentioned experiments, the emphasis is primarily upon the income and price sensitivity p ters i 
Finally, certain aspects of one explanation of the decline in clothing expenditures in the post war period which 
relies heavily upon time series observations are tested with the B.L.S.-Wharton data. The major point of interest 
in these tests is whether, for urban families in 1950, low levels of clothing expenditures tended to be associated with 
high levels of durable goods purchases. 





The Influence of the Interviewer on the Accuracy of Survey Results. R. H. Hanson and E. 8. Marxs, National 
Analysts, Inc. 


During the 1950 Decennial Census, the Bureau of the Census conducted an experiment to study the effects 
of interviewer variation. This experiment involved over 900 enumerators covering 1800 enumeration districts 
(ED’s) with a total enumerated population of 980,000 persons in 24 counties in Ohio and Michigan. Assignments of 
ED’s to enumerators were made at random within 134 geographic areas to control the effect of variation in the actual 
respondent ebaracteristics and the interaction between respondent and interviewer characteristics. 

Interviewer variance was analyzed for 100 E.D. characteristics, including the usual Census characteristics 
(age, race, income, etc.) plus some special measures related to quality of performance (number of improperly omitted 
entries, difference of obtained from expected sampling proportions). As measured by the F-test, about 4 of the 100 
characteristics showed interviewer contributions to variance significant at the .01 level (at least one-third of these 
significant at the .001 level). 

All of the measures of improperly omitted entries show interviewer variance contributions significant at the 
.001 level. Factors entering into significant interviewer contributions to variance appear to be (1) interviewer 
“resistance” to a given question—i.e., a tendency to omit or alter the question and to assume the answer; (2) rela- 
tively high ambiguity, “subjectivity,” or complexity in the concept or wording of an inquiry; (3) the degree to which 
additional questioning (“probing”) tends to alter initial respondent replies. 

Also investigated was the relationship of interviewer performance to interviewer characteristics as given by 
(1) seores on a number of tests; (2) by such personal characteristics as age, sex, occupation, education; and (3) by 
attitudes toward, and expectations of, resp it reactions. At least one measure of poor quality of enumeration 
(number of improperly omitted census entries) seems to show substantial correlation with age, with some test 


+ 


scores, and with expectations about resp pera ti 








The Demand and Supply of Statisticians from a Survey of Industries and Colleges. Bovp Hansunarcrr, Virginia 
Polytechnic Institute. 


This survey covered a number of the large industries in America as well as various government agencies and 
colleges. The survey may be summarized in the following quotation from one of the respondents: 

“There is a great shortage of well-trained statisticians who are taught how to use the theory. We could produce 
five times as many M.S. and Pb.D. level statisticians and not swamp the market in ten years. The recognized need 
is being expanded much more rapidly than production. An engineer whe also has statistics training can be assured 
of $1,000 higher starting salary. An equally qualified statistician can easily secure $100 per month more than the 
same level engineer.” 


Maximum Likelihood Estimation from Sampling Data. H. O. Hartixy, Iowa State College. 


When estimating statistical parameters one is often faced with “incomplete data” in the sense that some of 
the observations in the random sample drawn are not available for the computation of the estimators. For example, 
if a time mortality study is terminated at a fixed point in time ¢*, then the exact length of life will be known only 
for those experimental! units dying before f*, while for deaths occurring after f*, only the number of deaths will be 
known. The estimation of the mean length of life from such incomplete data is known as “estimation from censored 
data.” 

If the number of deaths occurring after f*, is not known either, we have what is known as a “truncated sample.” 

These are the best known irstances of incomplete data and are contained as special cases in the more general 
situation dealt with in this paper, in which censorship and/or truncation may occur for any subset of the variate 
range. The computation of maximum likelihood estimators from such incomplete data is simplified and unified 
by a method which represents a considerable improvement on those in current use. It is in fact shown that maximum 
likelihood estimation from incomplete data can be reduced to that from a complete sample. The method is akin to 
the “missing plot technique” in analysis of variance, and it is applicable in any situation in which a maximum 
likelihood procedure for the complete sample is available. 


The 1955 Survey of Life Insurance Ownership. ALsert I. Hermawin, [Institute of Life Insurance. 


The purpose of this paper is to analyze the economic and demographic factors which are related to life insurance 
ownership and the attitudes and motivations which affect life insurance behavior. The data are drawn largely from 
the 1955 Survey of Life Insurance Ownership, a sample survey of the population conducted for the Institute of Life 
Insurance by the Survey Research Center of the University of Michigan, although other studies of life insurance 
ownership are utilized. 

The effects of income, age, place of residence, and education on life insurance ownership are set forth. Two 
patterns of life insurance behavior in the family are postulated—the tendency to have small amounts of life in- 
surance on all family members versus the tendency to concentrate relatively large amounte on the family bread- 
winner. The income and educational characteristics associated with each of these patterns are presented. The man- 
ner in which the income and demographic factors are limited in their ability to differentiate large owners from 
small owners of life insurance is also discussed. 
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The importance of certain attitudes and motivations on life insurance ownership is pointed out, through 
findings relating the reasons people give for owning life insurance and people's feelings about the adequacy of their 
coverage to life insurance ownership. Differences in ownership apparent in cartain demographic groups are seen, 
at least partly as differences in attitudes of these demographic groups. Possibilities for further research are sug- 
gested. 
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On the Problems of Coverage in Samples of Dwelling Units. Inenz Huss, University of Michigan. 


An investigation of noncoverage of dwelling units in Survey Research Center surveys is presented. Formerly, 
noncoverage in our surveys was in the neighborhood of 10 per cent. Thus it was roughly the same magnitude as 
the total of all nonresponses, and its reduction was most important. During this period of low coverage, the dwelling 
unit addresses for SRC samples were selected from prelisted blocks and supplemented by the “half-open interval” 
technique at the time of interviewing. An analysis of the reasons for noncoverage showed that it was high in low 
economic blocks, in medium-size cities, and in both the slow growing and the fast growing places. In less than one 
year after new procedures were introduced into about 60 per cent of the sample, our coverage increased to approxi- 
mately 97 per cent of the number of occupied dwelling units reported by the Bureau of the Census. These procedures 
are: (1) samples of city directory addresses supplemented by an area sample; (2) samples of small compact segments. 
Particularly gratifying is the success of city directory samples in the medium-size cities where coverage had been 
lowest. Now noncoverage, as a source of errors of nonobservation in our surveys, has been greatly reduced in mag- 
nitude and importance. 


An Experiment with Weighted Indexes of Cyclical Diffusion. Bert G. Hickman, The Brookings Institution. 


Diffusion indexes measure the extent of common participation of the various sectors of the economy in expan- 
sions and contractions of aggregate activity, and are used to study the relationships between the degree of partici- 
pation and other characteristics of business cycles, such as turning points and amplitudes. The usual index takes 
account only of the direction of change of the components, so that if in a group of 100 time series, 80 were increasing 
and 20 decreasing in a given month, the value of the index would be 80 per cent. An experiment was undertaken to 
determine if the behavior of a diffusion index of employment would be affected when it was weighted to allow for 
the size of the component industries, the amplitudes of the individual fluctuations, or the bined infi of 
size and amplitude. Size weighting made little difference, because the average size of industries in the expanding 
and contracting groups remained fairly constaut despite substantial fluctuations over time in the number of in- 
dustries contained in the two groups. Amplitude weighting exaggerated the monthly movements of the unweighted 
index, because the forces which made for an increase of diffusion also tended both to amplify the average increases 
in rising industries and to dampen the average declines in falling industries. Some suggestions are offered about the 
use of diffusion indexes in the search for the causes of business cycles. 





The Design of an Experiment to Measure the Effectiveness and Costs of Programs of Services for Aging Persons. 
Juxuros A. Jann, Community Service Society of New York. 


The existence of relatively high sates of unemployment, low income, illness, disability, sub-standard housing, 
social isolation, p ti and similar “problems” among aged p as pared to younger 
persons has become a matter of common knowledge and concern in many communities. A great variety and number 
of programs have been undertaken or advocated to “help older persons” affected by such problems, and large ex- 
penditures of time, money and other resources have been made and would be required to carry out such programs. 

Since resources for such expenditures are limited, a “good principle” to adopt in making decisions concerning 
which programs should be developed and administered is to select from all possible programs that one or few which 
will be “most effective” in overcoming specified “problems” of aged persons within given limits of “cost.” The 
determination of the “effectiveness” and “costs” of possible programs involve hypotheses that are subject to errors, 
so that in practice the programs selected may not actually be the “most effective” for given costs. However, by 
the use of acceptably designed experimental research methods, some of these errors can be eliminated and others 
can be reduced to specified limits of sise and probability of occurrence. 

Accordingly, the Community Service Society of New York City has been concerned not only with the develop- 
ment and administration of health and welfare services for older persons in New York City, but also with planning 
and carrying out experimental research to measure the “effectiveness” and “costs” of a number of alternative 
programs. The design for such an experiment is presented in this report. 








An Appraisal of Productivity Projections. James W. Knowzzs, Joint Economic Committee, United States Congress. 


The tests suggested in a previous paper are applied to the appraisal of productivity projections, of which 25 
are summarized in tabular form. First, were the projected quantities the best estimates which could be derived? 
This could not be answered with confidence, except in a minority of cases. Second, would other assumptions have 
been more efficient or appropriate for the particular purpose? Given the structures of the mode, in most cases 
changes in the productivity projections would not have improved the over-all economic projections, nor would 
mere changes in assumptions have improved substantially the productivity projections. Third, in most “full em- 
ployment” projections, the structures of the models were not optimum for the purposes leading to their construc- 
tion. Two reasons for deficiencies in these productivity projections were found. Inadequacies of both data and 
empirical research into the basic relationships between inputs, outputs, prices, and factor returns hamper the makers 
of such projections. Also, many mistakes in such projections can be traced to failure to start with a clear and precise 
statement of the objectives or purposes of the projections. 











SUMMARIES OF PAPERS 581 


Farm Income Estimates—Methods and Meaning. Naruan M. Korrsxy, Agricultural Marketing Service, USDA. 


The Department publishes regularly estimates of gross farm income, production expenses and net income of 
farm operators. Net income is shown for the U. 8. and by States on a realized basis and on a net income basis 
including the value of inventory change. In addition, the Department publishes income of farm people including 
income from nonfarm sources and comparisons of incomes of farm and nonfarm persons. The estimates of gross 
farm income are based largely on information collected by the Crop Reporting Board supplemented by the Census 
of Agriculture every 5 years. However, the information on production expenses and on nonfarm income received 
by farm people has been relatively weak. These have been reinforced by a recent nationwide survey of farmers’ 
expenditures and off-farm income. For the future, estimates of farm income by economic class of farm are being 
developed, particularly the break between commercial farms and noncommercial farms. There is also the continuing 
problem of how to assess differences in incomes between farm and nonfarm people. This involves the valuation of 
nonmoney income received from the farm, the return to farm investment, differences in the cost of living as between 
farm and city, relative skills and other factors. 


The Contribution of Cross-Sectional Survey Data to Prediction of Aggregative Components of Consumption. 
Heten Humes Lamatre, U. S. Bureau of Labor Statistics. 


Only a few years ago tion functi for diff. t types of goods and services were called the “wall 
flowers” of economic aaalyses, while search for the consumption function centered in time series analyses. The 
interrelatedness of time series and cross-sectional survey data and the need for understanding the component parts 
of aggregate consumption have been recognized. However, consumption functions derived from time series analyses 
and cross-section analyses differ. It is in the explanation of these differences that cross-sectional survey data make 
their greatest contribution to prediction of aggregate consumption. 

The cross-section studies provide detailed information for various components of aggregate consumption in 
relation to the demographic and economic characteristics of the consumer. These data, when properly analyzed, 
will provide many of the links to time series analyses needed for predictive purposes. Proper analysis, however, 
involves consideration of differences in ts, definiti and classifications between the time series and cross- 
sectional data. Some of these differences are discussed. Thus, the cross-sectional data provide the raw ore from which 
prediction tools may be derived but only after careful refining and processing. 








Median Age at First Marriage in the United States. Emanvext Lanpav, California Department of Public Health and 
Patience Lavuriat, Bureau of the Census. 


The medians discussed in this paper were calculated from decennial census data on marital status by single 
years of age. The tables (to be distributed) present estimates for 1940 and 1950 of median age at first marriage for 
States by sex, residence groups, regions, and. selected States by color and sex. 

The most striking characteristic of the rapid decline during the 1940's in median age at first marriage is the 
significant lessening of differentials between the various groupe in the population. Generally, the groups with the 
highest ages at marriage showed the greatest decli The decline between 1940 and 1950 for white persons was 
about one and one-half years, whereas the decline for nonwhites was less than half a year. The decrease for urban 
areas was almost twice as great as that for rural areas, for both whites and nonwhites. 

The Northeast, the region with the highest age at marriage for both males and females in 1940, shows the great- 
est decline during the 1940 decade, reflecting this region's large proportion of population living in urban territory. 
In each region, with the exception of the West, where in 1940 the median age at first marriage for nonwhites was 
higher than for whites, the decline among white persons was almost twice as large as that among nonwhites. In 
the West, the greater decline among nonwhites reflects the sharp change between 1940 and 1950 in the racial com- 
position of that region’s nonwhite population. 

The tendency for groups with the highest ages at marriage in 1940 to decline the most between 1940 and 1950 
is also evident in the data for States. Furthermore, the data indicate a striking similarity between States in the 
difference between the median age at marriage for males and females in both 1940 and 1950. The data for selected 
States also show a more rapid decline for whites than nonwhites, as noted above. 

A brief discussion of the method used in puting these medi is included, together with a short statement 
regarding differences between the various median ages at first marriage which are found in the literature. 











A Moéel for Failure Data. ANpre G. Laurent, Wayne State University. 


Given a “stimulus” S with cumulative effect X and a “System” )_, the response of the system can be described 
by the “X-to failure” probability distribution. The exponential type of failure which implies the constancy of the 
expected life of the system and is valid when there is no ageing process and death is accidental, has been thoroughly 
studied. A model is proposed which is characterized by an exponential law of decay for the expected life or “resistance 
to failure” of the system, namely Ez = Epo exp ( --aX). To estimate the parameters of the model, least squares, max- 
imum likelihood, moments methods, and methods based on ordered statistics are studied. The cases of truncated 
and censored samples are considered. 


Use of Sample Survey Data and Analysis of Variance as a Guide to Packaging and Pricing Policies. Dick A. Lzano, 

University of Michigan. 

The purpose of this paper is to demonstrate the usefulness of the application of the techniques of analysis of 
variance to sample data as a guide to packaging and pricing policies. While the data employed are hypothetical, 
actual market information may easily be substituted and the procedure remains the same. Therefore, while the 
conclusions based on these hypothetical data may not be true of a given market situation, the technique is still 
valid. 
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Evaluating Influences on Satisfaction with Patient Care in Hospitals through Multipie Regression Techniques. 
Everne Levine and Benjamin Bucusinper, Public Health Service. 


While the number of professional nurses in hospitals has grown to 265,000 and the number of non-professional 
or auxiliary personnel has reached nearly 400,000 there continues to be an acute and growing shortage of nursing 
services. However, evidence has been piling up to the effect that while a shortage of nursing services may exist, it 
is not entirely due to a shortage of nursing personnel. Some hospitals with high daily average hours of nursing care 
available per patient complain as loudly about being understaffed as hospitals with low hours. It seems as though 
factors other than the number of nursing personnel, such as poor utilization of nurses’ time and high personnel 
turnover, generate feelings of a shortage of services and consequent dissatisfaction with nursing care. The ratio of 
professional to iliary personnel on the nursing unit and the size and type of contro! of the hospital are other 
possible influences on the amount of nursing services provided and the level of satisfaction. 

In the spring of 1956, the Public Health Service, in cooperation with the American Hospital Association, 
launched a study in 60 general hospitals to explore the relationships bet the ber of hours of professional 
and auxiliary nursing care available per patient per day, the size and type of control of the hospital, and the feelings 
of shortages of nursing services, from the viewpoints of the patients, doctors and nursing personnel. Checklists 
were developed on which 20,000 patients and p 1 recorded actual omissions of important components of 
nursing care. The items were weighted by their importance, and average total scores of satisfaction with nursing 
care were computed for each type of respondent in each hospita!. A separate multiple regression analysis was carried 
se In the model, the dependent variable was this average total score of satisfaction 
expressed as a functi ters representing the amounts of professional and auxiliary nursing 
fr sam a en nn Rott 

The findings of the study indicate that more can be done to alleviate the shortage a eerviess and the con- 
comitant dissatisfaction, than merely recruiting more personnel. For example, p tion is related to 
the number of hours of professional care available, but not to the number of total (professional plus non-profes- 
sional) hours available. This is a strong indication that not just care by any nursing personnel is important to the 
patient, but that the skills and knowledge of the professional nurse enable her to understand the patient’s problems 
and therefore give care which transcends merely meeting the immediate physical needs. 


The Role of Industrial Classification in the Employment Security Program. Louis Levine, Bureau of Employmen 
Security, U. S. Department of Labor. 


The Bureau of Employment Security and the affiliated State employment security agencies have need for a 
wide range of data classified by industry for use in day-to-day operations, administrative planning, and economic 
analyses. The Federal-State employment security system encompasses progr for job pl t and employment 
counseling, stabilizing employment, minimising unemployment, reducing labor turnover and providing income 
maintenance through partial compensation of wage loss resulting from involuntary unemployment. 

Employment and earnings of workers covered by unemployment insurance laws, insured unemployment, 
estimates of total employment, labor turnover, employer estimates of future labor requirements, job openings 
listed by employing establishments, and workers placed in jobs by the public employment offices illustrate some 
of the types of data developed and utilized in the employment security system. Effective use of these data make it 
necessary that they be classified by the State agencies uniformly by industry. These data serve important purposes 
for others, both within and outside government, for program, marketing and general economic analysis. The revised 
Standard Industrial Classification system will make these data more significant and more useful to the system and 
to other users by facilitating comparison and coordination with data from other sources similarly classified. 


Variations in Mortality from Heart Disease. Epwanp A. Lew, Metropolitan Life Insurance Company. 


The paper presents the variations in the recorded mortality from arteriosclerotic heart disease, including 
coronary disease, in the United States with respect to age, sex, race, marital status, geographic region, socio-economic 
status, occupation, and a number of other factors. Corresponding data for Great Britain and other countries are 
also cited where available. It ; opears that many factors, seemingly unrelated, affect the death rate from arterio- 
sclerotic heart disease. If this 1s true, one should guard against the tendency to over-simplify and over-generalize 
in accounting for the variations in the incidence of the disease. The hypothesis of multiple causation appears to 
be most consistent with the known facts. The paper also calls attention to some of the gross biases that exist in 
the recorded mortality statistics for degenerative heart disease. Some of the difficulties inherent in the interpretation 
of such statistics may be overcome if greater uniformity can be secured in the terminology used in reporting heart 
disease on death certificates and if analyses of mortality from heart disease are extended to give consideration to 
multiple causes of death and to histories of morbidity. 























Table for the Determination of Two-Sided Tolerance Limits for the Norma! Distribution. ALrrep LizsrrMan, 
Dept. of the Navy. 


A solution for the computation of tolerance limits for the normal distribution has been given by Wald and 
Wolfowitz. Based on these results Bowker has prepared a table of factors K such that the probability is 7 that at 
least a proportion P of the distribution will be included between ¥+ Ks, where ¥ and are estimates of the mean 
and the standard deviation computed from a sample of N. 

The table prepared here is an extension of Bowker’s table. Bowker has determined K factors for means based 
on samples of size N and standard deviations based on N —1 degrees of freedom. The present table gives K factors 
for means based on samples of size N and standard deviations based on n degrees of freedom. A special case of the 
present table is the case when n = ~1, in which case the present table gives the same results as those given by 
Bowker. 
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The K factors are given for y =0.75, 0.00, 0.95, and 0.99; for P =0.75, 0.90, 0.95, 0.99 and 0.999; for values of 
N from 1 to 10, 20, 30, 50, 100 and @; and for values of n from 1 to 30, 40, 50, 100 and «. 

The primary additional use of the present table over those already existent is that it can be used in conjunction 
with experimental designs, where the number of degrees of freedom determining the standard deviation is usually 
not N —1. Another application of the present table is in the setting of tolerance intervals for regressions where the 
number of degrees of freedom is not N —1. 


The Contribution of Panel Studies to Forecasting the Automobile Market. Cuartes Linincer, Eva MveLuEr, and 
Hans Wyss, University of Michigan. 


Periodic surveys, each conducted with a new representative sample of consumers, indicate a close relationship 
between changes in aggregate car buying plans and changes in total sales—with only occasional exceptions. Re- 
interviews of the same people are a powerful method whereby forecasts can be checked on an indivudual basis 
as weil. At the same time they enable us to assess the predictive value of additional variables which may be thought 
to influence automobile demand. They can also show what intervening changes are responsible for non-fulfillment 
of plans or unplanned purchases. In addition, panel studies may provide purchase data for the same families over 
an extended period of time. 

The principle source of data for this paper is a panel study by the Survey Research Center financed by the 
Ford Foundation. It consists of an original interview in June 1954 with 1150 urban families and 4 subsequent 
reinterviews, the last one in February 1957. In addition the paper draws on data from reinterviews with the Survey 
of Consumer Finances sample in 1949 and 1953. 

One part of this paper presents deta on the pattern oi automobile buying ever a 34 year period within groups 
differing by age of the family car. Car purchase plans, their fulfillment, as well as unplanned purchases also are 
examined within these groups. A second part of this paper compares fulfillment of car buying plans in relation to 
certain characteristics of the plan—plans to buy new cars vs. plans to buy used cars, plans expressed with certainty 
vs. plans that are uncertain, short-run vs. long-run plans, planned cash purchases vs. planned credit purchases, 
and so forth. The third part investigates the contribution which attitudes other than buying plans may make to 
forecasting automobile purchases. 


Research Methods for Analyzing Short-Term Demand for Consumer Durables-—Housefurnishings and Equip- 
ment. Vernon G. Lippitt, General Electric Company 


This paper deals with the question: What research methods show most promise of yielding an understanding 
of consumer demand for housefurnishings and equipment—an understanding adequate for forecasting aggregate 
demand for these items reasonably accurately for twelve months into the future? 

Research methods discussed include: extrapolation; correlation of aggregate time series to predict aggregate 
demand; use of aggregate time series to predict segments of total demand; surveys of intentions to buy; use of 
cross-section data to determine relation between buying behavior and household characteristics; combining evi- 
dence from cross-section data and time series, at the aggregate level or through repeated cross-sections or reinter- 
view data. Problems of aggregating the effects of family characteristics are discussed. Suggested areas for further 
research effort include: more analysis of existing cross-section data, probably using analysis of variance approach; 
further efforts to combine evidence on effects of family characteristics and market conditions, especially through 
repeated surveys or reinterviews; investigation of the stability or trends of demand relations through time; experi- 
mentation with Orcutt’s model involving a sample from the universe of consumer units and direct aggregation of 
buying behavior; securing more survey data on family expenditures by major categories, including longitudinal 
studies of given consumer units over time. 


Some Statistical Problems in Urban Renewal. Bensamin Lipsretn, Market Planning Corporation. 


The planner’s needs include both bench mark data on the status of the area and information on trends. The 
sample survey of population and housing is capable of producing the basic bench mark data with a high degree of 
reliability and simultaneous information on trend movements for the population that resides in the area. There 
are, however, a number of unique and difficult problems to be considered in conducting surveys in urban renewal 
areas. (1) The standard definitions should be examined for their applicability to local conditions. (2) Consideration 
must be given to proper public relations, lest refusal rates make the study useless. Both the size of the sample and 
the questionnaire mechanism must be geared for adequate control of interviewer variability and application of 
definitions which are more complicated than those usually found in consumer and market studies. 


Automatic Programming as a Means of Enabling Statistical Applications of Computers. Mrrcneti O. Locks, 
Sperry Rand Corp. 


The statistician encounters some difficulty in using automatic high speed digital! computers. To program an 
application, it is y to cc a great deal of time in applying an intricate system of codes. Automatic 
programming systems can reduce the barrier between the statistician and his computer. The most advanced type of 
automatic programming is the compiler. Under a compiler system, pseudo-codes are used instead of machine codes. 
Each pseudo-code refers to a complete subroutine in a library of subroutines. By using a compiler’s simple and 
mnemonic pseudo-codes, the application can be programmed in a small fraction of the time that it can be directly 
in machine codes. The empirical portion of this paper deals with a case study in using the A-2 compiler system 
of UNIVAC I in coding a General Purpose One-Way Analysis of Variance. After verification of formulas, analysis, 
and flow charting it took two hours to code this problem in 26 steps of pseudo-code. This resulted in 600 machine 
instructions. Compiling time (i.e., the amount of time taken by the machine to write its own program) was about 
one minute. Running time was negligible on some test problems. 
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Distinctive Characteristics of Psychometric Statisti Probl Arising from Errors of Measurement. Freperic 
M. Lorp, Educational Testing Service. 


Since observed scores on mental tests contain sizable random errors of measurement, the basic problem of 
psychometric theory is to obtain valid inferences about true scores from the scores actually observed. Quite satis- 
factory assumptions about the frequency distributions of the errors of measurement are available, and the psycho- 
metrician can obtain experimental data from which the parameters of these distributions can be estimated rather 
accurately. Unsolved (or inadequately solved) but very basic problems include: (a) estimating the true score of 
each individual in the group tested from his observed score; (b) estimating the shape of the frequency distribution 
of true scores for the group tested; (c) testing the hypothesis that two different tests measure the same thing, i.e., 
that their true scores have a perfect curvilinear correlation; (d) estimating the shape of the functional relation 
between true scores on two tests, or of their regression curves. Each problem is outlined in mathematical form, 
together with the kind of data available and a possible set of appropriate assumptions. 





The Evidence Relating Diet to Coronary Heart Disease. Groner V. Mann, Harvard University. 


The origins of the hypothesis relating the human dietary . coronary heart disease (CHD) date back to the 
eighteenth century. The relationship became of theoretical importance in the 1850s when Virchow and Rokitansky 
debated the pathogenesis of the fatty, arterial lesions they observed. The dietary causation of atherosclerosis was 
emphasised by the production of atheroma in rabbits by dietary regimens. All subsequent experi tal athero- 
sclerosis has been accomplished by increasing the serum cholesterol level and this has generally been done by 
dietary means. 

We can not conduct the critical human experiments that would be the direct approach. These are impossible 
for ethical reasons and also because of the chronicity of the process we are studying. It seems important to relate 
the usual diet to the level of serum lipids, to atherosclerosis—an anatomical process—and to the behavior of CHD 
—a aH } ph 

There are four ways to evaluate dietary intakes (1) food disappearance data gathered by economists (2) family 
surveys (3) individual dietary records (4) personal interview estimates. The techniques and the limitations of each 
will be discussed. 

There are no direct methods for evaluating any of these in an absolute manner. The indirect ways consist of 
(1) correlated evaluation of subjects’ physical status (2) comparisons of energy intake measured by the dietary 
appraisal with estimated energy expenditure (3) small scale samples of food weighing data. Experience with these 
methods will be discussed. 

The enticing prospect of a quick and simple dietary evaluation by application of a short questionnaire designed 
to measure the intake of a few “key” foods has not been successful because the evidence for identifying these key 
foods—if such exist—has not been obtained. 


Some Probability Considerations in the Estimation of Computer Running Time. Ex1 8S. Marxs and Bensamin J. 
Terrie, National Analysts, Inc. 


When the operations performed by a computer are carried out serially the estimation of computer running 
time is quite straightforward, running time being the sum of the time taken by the various individual operations, 
each of which may be estimated separately. However, one of the major features of modern high speed digital com- 
puters is that they are designed to read, write, and compute simultaneously or nearly so. Where the computer 
program involves handling relatively large volumes of input and output with a moderate amount of manipulation 
of the data internally, the problem of estimating total time required may become difficult. 

One way of estimating time in such circumstances is to program the problem in detail and observe the actual 
running time. This may not be possible since (a) the equipment proposed may not exist at the time that the estimates 
are required; (b) it may be very costly and time consuming actually to obtain fully tested programs. The procedure 
described in this paper is first to construct a program which is not necessarily complete or accurate in all details, 
but is sufficiently so as to provide reasonable estimates of the time required for the basic operations that are involved 
in the program. A probability model is then set up, on the basis of assumptions regarding the distribution of the 
input by “type,” the latter being defined by the basic operations that are required to handle the input record. On 
the basis of the probability model it is possible to estimate the means and variances of the delays that are encoun- 
tered, that is, the intervals of time during which operations are interrupted because other operations must be com- 
pleted first. This leads directly to estimates of the expectations and variances of the elapsed time for handling 
records drawn at random from a specified statistical population. An application is described in which the relative 
efficiencies of two computers are compared, fee a given file maintenance problem. For one of the computers, the 
application involved “double-progr 

















Greenhouse and Laboratory Experiments. Cant E. Manrsnaty, Oklahoma State University. 


The sources of variation, limitation of facilities, and the techniques of caring for experiments cause greenhouse 
and laboratory experiments to be as hazardous as any other type of experimentation. He who feels that the variation 
in this type of experimentation is so small it can be ignored, should experience an appropriately designed uniformity 
trial. A critical look at greenh and laboratory experiments reveals that although the experimenters are in- 
terested in testing hypotheses about central tendencies and measures of variation, they have considerable interest 
in the areas of response curves and estimation. 

Present day experimenters are discarding the one factor approach in favor of designs which will accommodate 
the more natural conditions, whero many factors vary simultaneously and frequently not independently. Factorial 
arr ts of treat: ts are proving quite valuable. The use of the Latin Square, the Youden Square, and other 
balanced and partially balanced incomplete block designs are increasing the efficiency of greenhouse and laboratory 
experiments. This type of experimentation usually implies a simulation of conditions or situations which are ex- 
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pected to be experienced on a much larger scale. The statistical inference must of necessity be made to the popula- 
tion of experimental units and any transfer of inference to any other population of interest must be based on judge- 
ment, logic, or intuition. An exploratory experiment should be interpreted as such. Frequently it will be advanta- 
geous to disregard the principle of fixed sample size and turn to the multi-step approach of experimentation. This 
sequential type of experimentation is especially useful in laboratory experiments. 

The statisticians should be urged to publish articles concerning statistical techniques in the language of, and 
through the media of, the various subject matter fields. One method open to all is to work for joint publications 
in the fields which demand their attention. 


The Predictive Value of the National Bureau’s Leading Indicators. Franx E. Morais, Investment Bankers Associa- 
tion of America. 


The National Bureau's statistical indicators were not intended to constitute a complete basis for forecasting. 
In themselves, they provide the foundation for only a partial analysis of cyclical trends. The user must modify 
the results given by the indicators in the light of other information available to him. This fact complicates any 
attempt to make an objective appraisal of the predictive value of the indicators during the postwar years. 

In general, the lead-lag relationships which prevailed during the prewar years have continued to prevail during 
the postwar years, despite important structural changes in the economy. A competent analyst, using the National 
Bureau indicators, should have been able to produce reasonably accurate forecasts of the recessions and revivals 
of 1948-49 and 1953-54. It is possible (although certainly not inevitable) that he may also have predicted fluctua- 
tions which never occurred (at least on any significant scale). The user of these indicators during the postwar years 
may have made mistakes in forecasting the timing of cyclical turning points and in judging the amplitude of swings 
in the economy. He should not, however, have made mistakes with regard to the direction of movement of the 
economy. 

A number of difficult problems are encountered in applying the indicators to forecasting on a current basis. 
The most important of these are: (1) erratic movements in the basic data, which cannot be entirely eliminated by 
smoothing devices; (2) substantial differences in lead times from cycle to cycle; and (3) the fact that the leading 
series is responsive to leveling-off periods or minor cyclical movements as well as to major cyclical trends. 

The difficulties caused by erratic movements in the basic data can be reduced by introducing a new dimension 
in obtaining a consensus of the indicators. The consensus measure most commonly used is the “percentage ex- 
panding” index, which merely records the direction of movement of the indicators. Another measure in use is the 
“duration of run” index, which weighs the direction of movement of the indicators by the number of consecutive 
months that the movement has occurred. This produces a smoother index, but one which occasionally lags behind 
the “percentage expanding” at cyclical turning points. Neither of these commonly-used indexes refiects the rate 
of change in the indicators. 

Experiments with consensus measures which give weight to the rate of change in the indicators show that 
indexes reflecting this information possess some significant advantages. One such index, which might be called the 
“duration-rate” index, gives equal weight to the duration of run and the rate of change in the indicators. This 
“duration-rate” index shows considerably less erratic movement than either the “percentage expanding” or the 
“duration of run,” and it does not lag behind the “percentage expanding” index at cyclical turning points. 

The ability of the “duration-rate” index to bring out cyclical trends more clearly without loss of sensitivity 
rests on the fact that shifts in the direction of movement of the indicators are usually preceded by a slackening 
in the rate of change in the indicators. 

The postwar performance of the National Bureau indicators would seem to demonstrate their usefulness as a 
tool in the forecaster’s lamentably inadequate kit. There is much room for further development of this type of 
approach. In particular there is need for: (1) a broadening of the coverage of the leading indicators and (2) the 
establishment of some theoretical linkages between movements of the indicators and movements of the aggregates 
to be forecast. 


Field Experiments. Davip D. Mason, North Carolina State College. 


A range of problems in field experiments, both of a practical nature and theoretical nature, are posed and 
discussed. Particular attention will be given to problems in planning field experiments for technical information 
in agronomic, horticultural, and agricultural engineering. Presentation and illustrative examples of problems in 
sampling the population of conditions over which inferences are to be made, choice of treatments, plot size, and 
sampling schemes will be given. Some problems in the use of supplementary measurements for adjustment of data 
by covariance or other means will be discussed and illustrated by example. 


Industrial Activity and Productivity. Miron Moss, Federal Reserve Board. 


Averages of only year-to-year changes may at times obscure the underlying relations between changes in 
output, capital investments, labor productivity, wages, and prices. To maintain a monthly output index, manhours 
have served as the best alternative to measurement of areas that are particularly difficult to measure, where the 
production process is long and products are continually changing in complexity and are particularly difficult to 
standardize. Manhours and output are shown to have very similar monthly movements for the 110 series in the 
Federal Reserve production index based on physical product data. Also, annual changes in output per manhour 
have been broadly similar for the output series based on monthly product data and the output series based on ad- 
justed manhours. The Federal Reserve is undertaking to obtain monthly data on use of electric power by type of 
industry group. Such data should help improve the accuracy of monthly production estimates in those areas where 
adequate physical quantity data are lacking and where output determinations are inevitably difficult, particularly 
in the metal fabricating industries. The making available of thly power figures for important industry cate- 
gories should go far in improving our understanding of relations between electricity use on the one hand, and out- 
put, manhours and other inputs, on the other hand, 
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Sample Surveys as a Tool for Life Insurance Research. Eva Mux.imr, University of Michigan. 


Among the research tools at the disposal of students of the life insurance market, the two most widely used 
are analyses of policies and sample surveys of the population. The great advantage of an analysis of policies is that 
one starts with company records and therefore has plete and accurate knowledge of the characteristics of the 
policy. The chief limitation of an analysis of policies is that it provides little information about the characteristics 
of owners and buyers and no information at all about non-owners and non-buyers. Hence it is argued that for most 
research concerned with the dynamics of the life insurance market, or with life insurance purchases as an aspect of 
consumer behavior, personal interview surveys are the more suitable research tool. 

The validity of personal interview surveys regarding life insurance depends on sample design and sample sise 
as well as on interviewing and questionnaire design. Insurance questions must be worded in terms which are mean- 
ingful to respondents. And the questions should not go beyond people's knowledge about life insurance in general 
and their own policies in particular. The potentialities and limitations of life insurance surveys are illustrated by 
discussing the methods of a survey of Life Insurance Ownership conducted in 1955 by the Survey Research Center 
of the University of Michigan for the Institute of Life Insurance. 





Evaluating Case History Data at Time of Admission to a Juvenile Training Institution. Paut Mue.ier, Depart- 
ment of Institutions, Olympia, Washington. 


For treatment assignment and assessment purposes a behavioral classificatory technique was developed for 
use with institutionalised juvenile delinquents’ case histories. Narrative definitions, augmented by specific ex- 
amples, define polar traits forming six behavioral dimensions: Aggression, Dominance, Non-Conformity, Gang 
Behavior, Anxiety, and Withdrawal from Peeples. Dimensional selinbiities for independent ratings on seven-point 
scales range from .56 to .88. Three types: Socialized, U ialized Aggressive, and Emotionally-Disturbed Delin- 
quents, are defined by “Ideal Type” behavioral profiles, wherein each type is assigned an “ideal” rating on each 
seale. Behavioral ratings for thirty delinquent boys were transformed into slope indices, i.e., comparisons of gradi- 
ents between all points, and each boy classified as the delinquent type with whose “ideal slope index” his slope 
index had the greatest number of agreements. Use of this technique on the ratings of independent judges resulted 
in identical classifications in 83.3 per cent of the cases. 

Further sub-classification, by means of a Personal Control score, based on ratings of Aggression, Dominance, 
and Non-Conformity, discriminated on a group basis between boys assigned to maximum and minimum security 
treatment facilities and boys who receive more or fewer severe disciplinary actions. Combined Slope Index and 
Personal Control score classification showed that minimum security assignments were clearly related to specific 
delinquent sub-types. Larger sample studies would determine reproducability of findings and rates of success for 
different types within varying treatment facilities. 








Implications of Pension Fund Accumulations. Viro Natre.ua, Securities and Exchange Commission. 


This paper presents new projections of the financial aspects of private pension funds based on trends and 
developments of the last few years. In 1956 contributions to private non-insured pension funds amounted to $2.3 
billion. It appears likely that contributions in 1960 and 1965 will amount to $3.2 billion and $4.4 billion, respec- 
tively. The projections imply that total assets of non-insured pension funds, which amounted to $16.6 billion at 
the end of last year, might increase to $29 billion by the end of 1960 and $52 billion by 1965. At the end of 1956, 
52 per cent of the assets of non-insured pension funds were invested in corporate bonds, 26 per cent in common 
and preferred stock, and about 14 per cent in U. 8. Government securities. 

For the period 1951-1956 as a whole, private non-insured pension funds accounted for 17 per cent of indi- 
viduals’ saving. Non-insured pension funds provided about one-fourth of the security financing required by corpora- 
tions in 1956. They accounted for over one-third of the new capital provided by all financial intermediaries for 
corporate expansion. It appears likely that the pension fund share in new financing will become even more important 
in the future. It does not appear from present indications that pension fund accumulations will present an investment 
problem in the foreseeable future under conditions of dynamic economic activity and a high level of employment. 


Summary on Housing. Rosinson Newcomse, Rodi: Ni bA iat. 


The paper points out that for many purposes careful local market analyses are more important than national! 
market analyses. The overall market may be up, while many local markets are down. Therefore even companies 
and mortgages interested in national markets must depend in the last analysis on local studies. Work is progressing 
in many centers, as Miami, Los Angeles, San Francisco, Denver, Seattle, etc. These studies provide continuous 
current information on production, price, and sale of new, and the price and sale of old houses, by sections of the 
city. In addition they provide data on vacancies, employment, income, population changes, etc., which enable 
builders, dealers, and mortgagees to keep unusually well informed of changing trends. Unfortunately not enough 
of this is being done in enough cities. 

The market for the country as a whole varies in the short run with changes in 8 major phenomena: marriages, 
aging of the population as a whole, deaths and divorces, income and its distribution, per cent of income going to 
housing and commuting costs, style, quality and quantity of housing offered, price and terms, and interest rates. 

Internal migration of population by itself is not listed as a major factor in the housing market, as it is not 
very important to either short- or long-term markets as a whole, as few areas lose population as a result of migration. 
Migration influences the location of the market more than its quantity. 

These 8 factors may be grouped into four categories: (1) Reshuffling of the existing population. For instance, 
at some periods a high proportion of adults have taken residential quarters. From 1950-1955 there was a net growth 
of nearly 2 million new households other than husband-wife households, as compared with a net growth of just 
over 2 million in the number of married couple households. The ratio of growth in other than husband-wife to 
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husband-wife households was 3.5 times the ratio for the period 1947-1950, (2) Changing social patterns as in in- 
comes, and in the use of these incomes, Autos were more important than housing in the 20's; housing is more im- 
portant than autos today. (3) The housing and its environment offered the market. Two-bedroom houses were not 
popular in 1950. One-bathroom houses are not popular in 1956. New houses must keep abreast of changing tastes. 
(4) The price of the house. Housing prices rose precipitously from 1953-1955. Interest rates rose too. The size of 
the down payment, the length of the mortgage, and other terms were modified. A close watch must be kept on these 
four major factors, and on the influences behind these factors, such as the attitude of government officials, of 
mortgagees, of builders, and of families. If these attitudes can be forecast, this effect can be approximated, with 
luck, and usable prognostications made of the volume of home building to be anticipated. 


A Program for Development of Crop and Livestock Estimates. Srmrtina R. Newew, U. S. Department of Agricul- 
ture. 


The Agricultural Estimates Division, AMS, is a major source of agricultural statistics. The Division has 
recognized the need for improvement in accuracy as well as scope of its statistics in esder to serve our growing 
economy properly. For the past four years, a research program has been conducted to inv ible improve- 
ments in the ry of estimates and forecasts by probability sampling and objective measurements. Results 
indicate that this type of approach, used in conjunction with current procedures, would provide sizable improve- 
ments in the statistics produced. 

This research by the Division has laid the foundation for a recently developed long range plan for strengthening 
its statistics. When fully implemented, this plan calls for enumerative surveys made in June, October, and December 
of each year to estimate acreages, livestock numbers, and farm labor. About 15,000 sample segments would be 
enumerated so that sampling errors for major crops and other items would be two per cent or less. Yield surveys 
in August, September, and October, and monihly surveys of prices would be made on subsamples from the June 
Survey. This plan would require the present budget of the Division to be almost doubled. However, the improve- 
ment in services rendered should more than justify the increased cost. 








Labor Market Experience After Exhaustion of Unemployment Benefits. Leo M. Onwicz, Bureau of Employment 
Security, United States Department of Labor. 


Some 19 State Employment Security Agencies throughout the country have participated with the Bureau 
of Employment Security in the conduct of studies examining the personal characteristics and labor market experi- 
ence of claimants who have exhausted their unemployment benefit rights. How long do such claimants remain 
unemployed after exhaustion? Do large proportions of them leave the labor force within two months—four months 
—after receiving final payment? What relationship, if any, existe between age, sex, industry or other characteristics 
and reemployment experience? The answers to these and similar questions are sought through these studies. Three 
aspects of the project will be covered by the paper: (a) the basic objectives of this type of research, (b) description 
of the methodology, including improvements over earlier techniques as well as limitations, and (c) a report on the 
study findings and their significance. 


Ranking in Triple Comparisons. Rosert Nixon PenperaGrass, Radford College. 


General extensions of the probability model for paired comparisons, which was developed by R. A. Bradley 
and M. E. Terry, are considered. The mathematical model recommended for triple comparisons is based upon the 
assumption of the existence of true treatment ratings, or parameters. A general class of tests of the null hypothesis 
that treatment ratings are equal is discussed, and maximum-likelihood estimators of the treatment parameters 
are obtained. These estimators are used in the evaluation of test statistics that are derived as functions of the likeli- 
hood ratio. The null hypothesis that treatment ratings are equal is tested against two special alternatives. One 
alternative hypothesis makes no of eq y of treatment ratings. The other alternative hypothesis 
makes the assumption that there are only two groups of treatments and within group treatments do not differ in 
ratings but the two groups themselves may have different ratings. The asymptotic distributions of test statistics 
are discussed and approximate tests by means of the chi-square distribution are recommended. When compared 
with the analysis of variance, the relative efficiency of the method of triple comparisons is approximately 72%. 
When compared with paired comparisons, the relative efficiency of triple comparisons is 150%. 


The School Leaver in the World of Work. Marcarer L, PLunxert, Bureau of Labor Statistics. 


The Department of Labor, Bureau of Labor Statistics, has completed a survey of youth work experience in 
Clarksburg, West Virginia (Harrison County), and has studies in progress in five other widely separated areas most 
of which have experienced labor surpluses in recent years. Youth as defined in these studies are secondary school 
graduates who do not go on to college and students who drop out of secondary school before graduating. The purpose 
of these surveys is to find out what adjustments these young people make to the labor force and the relationship, 
if any, to school background. The method used in Harrison County was to get from the echool records, basic data 
about all school leavers from September 1951 to May 1955, such as sex, age at leaving school, I.Q.'s, highest grade 
completed, reason for leaving, and number of vocational courses completed. From this universe, a 25 per cent sample 
was drawn for personal interview. From each school leaver who was still in the county in the summer of 1956 the 
interviewer obtained information on his reasons for leaving school, post-school training, if any, his employment and 
unemployment experience, and the type of jobs obtained. It was hoped that these data would throw light on why 
the unemployment rate of young people is so much higher than that of older age groups. If the school leaver himself 
was not still in the county, briefer interviews were held with a family ber or a neighb 

By the summer of 1956, nearly half of all the school leavers had left the county. Porty-Sour per cent of all the 
male school leavers were in military service at that date. Of all those who remained in the county (both boys and 
girls), and who regarded themselves as in the labor force, 95 per cent had found regular jobs at some time, after 
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leaving school, but as of the summer of 1956, 13 per cent were unemployed. Graduates started to look for work 
sooner than dropouts and a much higher proportion of them found jobs within a week after starting to look. The 
boy graduates were most likely to get their first job in some type of semi-skilled work and the boy dropouts in 
unskilled work. Girl graduates were most likely to become salesclerks or typists and stenographers right after leaving 
school, while the girl dropouts found first jobs most often as waitresses. 

There was a greater difference between earnings of boys and girls in the summer of 1956 than there was between 
graduates and dropouts. A much higher proportion of boys than of girls earned $70 or more a week, and boy drop- 
outs earned more than girl graduates. The girl dropouts had the most irregular employment pattern of all, and al- 
most two-thirds of them were earning less than $30 per week in the summer of 1956. 





Methods of Estimating Retail and Consumer Inventories. H. R. Powe.u, Jewel Tea Co. 


The food chain industry is rather unique among major American industries to the extent that consumer in- 
ventories of foodstuffs, both perishables and non-perishables are virtually non-existent, at least from the viewpoint 
of the chain operators. The food chains themselves perform the stock-carrying function and the consumer makes 
purchases as needed from day to day. On the whole, food chain inventories turn very rapidly although the turnover 
rate of 15 to 25 times each year hides the fact that inventories of perishables such as meats and produce, turn much 
more rapidly than do the seasona! packs of canned fruits and vegetables, for example. It is the latter category and 
in other dry grocery ions where some simple statistical measurements have significance. While the level of inven- 
tories is ulti ly a function of sales, a controlling factor very frequently is a fixed inventory investment, expressed 
in the form of week's supply. Our investigations point to the conclusion that overall investment objectives of this 
form give rise to purchase limits and restrictions which might be inconsistent with what could be termed “economic 
quantities.” That is, a given investment objective could result in either overstocking or understocking in specific 
lines or items if a simple method of interrelating the variables of merchandise flow, buying cost and carrying cost 
is not available. Such a method is feasible and in practice yields significant cash savings and often results in reduced 
inventory investment. : 








Seasonality in the Consumer Price Index. H. E. Ruteyr, Division of Prices and Cost of Living, Bureau of Labor 

Statistics. 

The Bureau of Labor Statistics Consumer Price Index is subject to relatively little seasonal variation. This 
arises from the fact that several of the important components of the market basket, such as rent, property taxes, 
and transit fares, do not fluctuate in any periodic fashion, and there are broadly counterbalancing seasonal move- 
ments in other groups. The seasonal index for food reaches a peak in July, from a low point in February and March. 
On the other hand, apparel, fuels, house furnishings and automobiles tend to reach their highest points in the early 
and late months of the year, and are at low levels during the summer months. An important and as yet unresolved 
problem concerning the Consumer Price Index is that of adjusting the index for seasonal variations in consumption. 
The use of varying monthly weights provides a satisfactory index for year-to-year comparisons, but violates the 
essential principle of the fixed weight formula when used for month-to-month comparisons. A possible solution of 
this problem involves a combination of the traditional formula for yearly comparisons and an index for monthly 
comparisons derived by dividing a value index by a quantity index. 


New Statistics on Companies and on Concentration in Manufacturing from the 1954 Census. Irvine RotrenBena, 
Bureau of the Census. 


This paper describes briefly the recently published statistics from the 1954 Census of Manufactures on con- 
centration, presents some additional! data on concentration derived from the 1954 Census returns, makes some com- 
putations from the published figures on concentration, and describes the forthcoming data of the Census Bureau's 
enterprise-establishment tabulations. 

Some of the new information presented is as follows: 1. Herfindahl’s summary index of concentration has been 
computed for a sample of 48 industries. This index takes into account the size of each company individually. 2. 
Treating two years as a single time unit, four company concentration ratios have been computed for 20 industries. 
3. Company and plant concentration ratios have been computed for the 48 industry ple, in most inst, for 
20 companies and 20 plants. 4. Data on imports and concentration are shown for the 43 four-digit product groups 
where imports are 10 per cent or more of the shipments of domestic producers. 5. The 1947-1954 change in con- 
centration for 375 industries is shown with averages compiled by weighting the concentration ratio in each industry 
in four different ways: by 1947 employment, 1954 employment, 1947 value added rd manufacture, and 1954 value 
added. All four of these computations sh d slight i in trati about one percentage point. 








Sequential Distribution-Free Tolerance Regions. Sam C. SaunpERS, Boeing Airplane Company. 


The stochastic process X»=(Xi,..., Xn, ...) defined on the probability space (Qy, Aw, Pw) is assumed to 
be generated by countably many independent replications of a random variable X on (Q, A, P) when (Q, <) is a 
linearly ordered set and P is one of a class of measures that assign measure zero to the set of equivalence classes on 
Q induced by <. 

Let {An} be a sequence of non-empty sets such that 

1. An is a subset of the first (n+1) whole numbers such that the number of elements of \, = {1, 2,..., n+1} 

—. is always the same, say », for n >». 
2. The set valued function D defined by 


Din, tw) = sup {w E Q:2;" <w < 2;"}, 
iG, 


where 2;” is the jth ordered observation of (m, ..., Zn) determined by <, satisfying the requirement that 
D(n,-)CD(n-+1,-) for n=1, 2, 
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Let jan } be a sequence of non-negative integers such that a, =0 implies an41 =0. We define the stopping event 
Brion by 


Busan = (Zw E My:D(n, zy) = D(n + Gn, ty) }. 


Buia, nA Bisa; ifm =n +a, for some n > 9 


Sm = r. i= 


then {S,,} is a sequential sampling plan determined by a decision ruler =({An}, {an}). If {an} is such that limp,— © 
a,/n =0 then Py» {supm Sm} =1. Then Nis the random sample size defined by N (zy) =m iff % =Sm and the random 
set function D(N, Xj) is a tolerance region which is distribution-free in the sense that the random variable Q(N, Xy), 
called the coverage and defined by 


otherwise 


Q(n, te) = PfD(n, Zw)] iff tw E Sn, 


has a c.d.f. which is independent of the choice of P for any r. Several different procedures are easily obtained and 
studied (two of which have been discussed earlier in the literature by Wilks and Jarina) by the general method 
outlined. 


A Price Index for Custom Made Products, Luorp Savitie, Duke University 


Goods produced to the specification of the individual buyer present special problems in the construction of 
index numbers of prices. Because items are made to order, each new sale usually represents a distinct product and 
80, a new price. As a result of this, an individual establishment may quote thousands of different prices for thousands 
of identifiable products. And because there are frequently large bers of firms in industries producing parts to 
order, millions of separate prices are charged for millions of unique products. Under these conditions of profusion 
and change, a sample almost inevitably is both excessively small and conservatively biased. Thus the resulting index 
is frequently less than adequate. 

Some of these primary problems of scope and emphasis are overcome in the proposed index. By using a gener- 
alized version of the pricing formulas employed by an industry, average prices can be computed from reliable 
indexes of factor prices. Prices derived in this fashion may be a more accurate indication of the movements of average 
prices in certain industries than those achieved by traditional sampling techniques. 





APHA Housing Surveys and Rehabilitation Costs. ALpert H. Scuaar, University of California, Berkeley. 


Surveys based on the American Public Health Association’s method of appraising housing quality are widely 
used to guide local housing programs. With rehabilitation assuming a key role in many Urban Renewal projects, it 
is important that an APHA survey show its possibilities. This study tests the significance of the relationship be- 
tween APHA dwelling unit penalty points and rehabilitation costs. A correlation analysis of cost and point data 
for 47 structures located in Oakland, California, shows that costs and points are significantly interrelated and that 
a useful cost-estimating regression formula can be developed by compiling cost data for a small sample of structures 
drawn from those included in an APHA survey. The methodological technique of making such an analysis, however, 
requires a consideration of several factors: 1. A uniform definition of rehabilitation is necessary and should be most 
meaningful and realistic when applied to the structures and the area. 2. Since the APHA index does not specifically 
reflect relative costs, weighting of individual items in the index may be ry. 3. M ment problems are 
presented because structures typically vary in number of dwelling units and in size and because the penalty points 
refer to dwelling units while the cost estimates refer to structures. 





The Implications of the Changing Ownership of Federal Securities. Ina O. Scorr, Jn., University of Minnesota. 


During the postwar decade, the shares of the Federal debt held by commercial banks, mutual savings banks, 
and insurance ies have declined. Those of the U. 8S. Government investment accounts, state and local govern- 
ments, and miscellaneous investors have increased. Corporations, individuals, and the Federal Reserve banks have 
maintained about the same relative position. Unlike the other investor groups, the U. 8. Government investment 
accounts are, for the most part, restricted by law to Federal securities. Assuming no increase in the Federal debt, 
the U.S. Government investment accounts will present an inelastic demand for a fixed supply of Federal securities. 

Eight out of fifty-two trust funds and other accounts handled by the Treasury presently hold the bulk of 
Federal securities owned by these accounts. Five of these funds, The Federal Deposit Insurance Corporation, the 
Civil Service Retirement and Disability Fund, the Federal Old-Age and Survivors Insurance Trust Fund and 
Disability Insurance Trust Fund, the Railroad Retirement Account, and the Unemployment Trust Fund, will 
expand an estimated $3 billion, $6 billion, $35 billion, $2 billion, and $2 billion, respectively, or a total of $48 billion, 
by 1975. The projected expansion of the trust accounts will absorb over 60 per cent of the outstanding marketable 
debt. This development will greatly improve the Government bond market and facilitate the Treasury's task of 
lengthening the term structure of the debt. 


Drop-Outs in High-School in Large Cities. Davi Secen, Office of Education. 


A cooperative study of drop-outs in large cities—over 200,000 population—was carried out over the four 
years required from the entrance of the freshmen in high school to their graduation. One objective was to create a 
new method for pupil ting in dary schools and to explore its effectiveness in operation. A second objec- 
tive consisted of noting significant facts uncovered through this superior method of accounting and suggesting 
possible improvements in the school program. 

The objectives of the study were attained. The superiority of the accounting procedures over the older survival 
rate method was clear and several facets of the data were of considerable importance to school programs. Among 
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the facts which seemed iraportant were (1) school leaving was concentrated around the tenth grade and (2) a large 
number of students transferred among the schools within a city and also among cities. 

Certain suggestions for improvement within the schools seemed to grow out of the findings of the study. Among 

these suggestions were: 

1. The curriculum of the high school needs to offer curriculum sequences adapted to the characteristics of the 
students beginning with the 9th or 10th grade. The present curriculum seems better adapted to girls than 
to boys. 

2. There is need for a concentrated guidance attack at the 9th grade level in giving students a view of the 
educational and occupational world, and he!p in the analysis of each individual in relation to his potentiali- 
ties so that assuming the school offers curriculums suited to these potentialities he can be aided in choosing 
and entering them. Through this process more students will stay in school until graduation and become 
assets to our country as well as to themselves. 

3. The great amount of transfer between schools indicates that, at least in each city, the high schocls—except 
for special trade schools—should all offer similar curricular seq and subject offerings. This would 
alleviate for students the difficulty of adjusting to a new curriculum when transferring. It is possible that the 
confusion resulting in trying to make an adjustment to a new school may be one of the major factors in the 
school leaving problem. 


Russian Population Censuses: Methods and Concepts. Gatena ¥. Setecen and Micnast K. Roor. 


A new population census has been decreed for January 1959 iz the Soviet Union. Preliminary plans for the cen- 
2vs were drawn up by leading statisticians in various conferences during July 1957, and on August 1, 1957 these 
plans were subjected to a test run in 7 selected districts of the USSR. One of the first clues which indicated that the 
Soviet government was preparing for a new population census was the publication in 1953 of a 150-page manual 
concerning the last (1939) all-union census. The methods and pts employed in the new census will, of course, 
be greatly influenced by Russia and the Soviet Union's past experience in census-taking and many of the problems 
which will be encountered can be anticipated by a ideration of the leading of the past. 

The forthcoming census will be the fourth to cover the entire territory of the USSR (earlier comprehensive 
censuses were taken in 1920, 1926, and 1939), although there was one general census of ths inhabitants of the former 
Russian Empire, in 1897. During the last half century the methods and pts of these censuses were subject to 
considerable change, in part due to the progress of science but still more to exigencies of Soviet economic planning 
and propaganda objectives—the latter two dominant since the nineteen thirties. In the 1939 census as compared 
with the census of 1926, 4 new questions were introduced to improve the accuracy of the count, while 4 questions 
were dropped and the meaning of several questions changed by definition, in great part to make the census corre- 
spond to an explicitly stated notion that the census should be ducted in a which would support the 
regime’s political objectives. Further, the detailed results of the 1939 census, unlike the 1926 census published 
in 50-odd volumes, was considered a state secret, only bare summaries of findings being made public. There is no 
doubt as to the high level of accuracy on which the prospective 1959 census will be conducted and tabulated as re- 
gards the needs of the Soviet administration and planning agencies. However, it remains to be seen the extent to 
which the results of the census will be objectively reported or made public in fill detail, and the outlook at present in 
this respect is not promising. 


Seasonal Adjustments of Economic Indicators—A Progress Report. Jutius Suisxin, Bureau of the Census. 


This paper reports on the experience accumulated during the past year in a research program on ways of dealing 
with certain problems that arise in making seasonal adjustments of economic series. 

1. An analysis of a sample of about 150 economic series shows that for the post-World War II period seasonal 
movements dominate the month-to-month cyclical and irregular movements in economic series. Thus, the average 
month-to-month amplitude of the seasonal fluctuations exceeds that of the cycle-trend factor in 78 per cent of the 
series, that of the irregular factor in 65 per cent of the series, and the cycle-trend and irregular factors in combination 
in 45 per cent of the series. This means that it is usually easy to isolate the seasonal factor but not the cyclical 
factor. 

















2. A test of the Univac method of decomposition was made by taking irregular, cyclical, and seasonal com- 
ponents computed for real economic series, scrambling the components of different series, decomposing the synthetic 
aggregates by the electronic computer, and then comparing the components put into the synthetic series with those 
yielded in the Univac decomposition. While the results generally appear on the basis of inspection to be satisfactory, 
they also show that the magnitude of the largest of the three factors is always reduced, with offsetting increases in 
one or both of the other factors. Where two of the factors are large relative to the third, the smallest factor can be 
significantly affected. The number of national series in which the smallest factor will be significantly affected is, 
however, quite small, probably less than five per cent. The proportion of inclividual company or local area series so 
affected may be much larger. 

The synthetic series were also used to test the contribution of the second round of computations, based on the 
weighted fifteen-term moving average, in Method II. The results of these tests are clear: this iterative technique 
substantially improves the seasonal adjustments. 

3. A test of alternative confidence limits for identifying extreme seasonal-irregular ratios shows that the nar- 
rower the limits, the smoother the seasonal adjustment factor curves. We have not yet been «le to establish objec- 
tive criteria for determining the appropriate t of th to be d by this step. After a study of the 





charts and tables constructed for this test, we believe intuitively that it is desirable to shift from control limits of 
two to one and one-half standard errors, and we plan to do so soon. 

4. Seasonal patterns of aggregate series will ordinarily change because of changes in the relative importance of 
the components, and the impact of this factor on the seasonal pattern of total U. 8. exports is shown. No measures 
of the general importance of this factor are, however, given. 
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5. A test was made of alternative ways cf computing seasonal factors for the ends of series—leveling off or 
allowing for a continuation of the trend. The tests compared the results obtained in both methods when only 
partial data are available and when full data are available. Five series, with different mixtures of irregular, cyclical, 
and seasonal components, were used. Both the inter-war and the post-World War II periods were covered. The 
test showed that the Method II technique of allowing for a continuation of trend gives better results than leveling 
off at the fourth from end year, for which no estimating is involved. We made similar comparisons with the third 
from end year even though it does include estimated ratios, in Method II. The use of the factors for this year and 
the Method I'I technique gave estimates of about the same accuracy. Whether this is a statistical accident or points 
to a way of improving the Method II technique, I do not yet know. 

A test was also made of alternative ways of estimating the twelve seasonal factors for the year ahead. This 
test clearly supports the theoretical position that it is better to include a trend allowance in making such estimates. 

On the basis of these tests, similar tests of other techniques of allowing for trends at the ends of series have been 
scheduled, and a routine has been added to the Univac program estimating seasonal factors one year ahead with a 
trend allowance. 

6. The plans for future extensions and tests of this program are described briefly. The principal extensions we 
have in mind are: (1) confidence limits for judging the significance of month-to-month changes, (2) tests of sig- 
nificance for the I, C, and 8 measures, (3) computations of the I, C, and S measures for different stages of the busi- 
ness-cycle, (4) a measure of secular trend, and (5) a semi-log chart program. We also plan to make tests of (1) al- 
ternative weight patterns for computing the seasonal factors for the end years and for the year ahead, and (2) al- 
ternative curves for measuring the cycle-trend factor. 

7. The studies made throughout the year emphasize the importance of careful professional review of the 
electronic computer results, particularly at the ends of series. 


Measurement Problems in Collecting Supplementary Data about Deceased Persons. Monroe Strxen, Public 
Health Service. 


The information contained on the death certificate provides the basic tabulations of mortality statistics, but 
frequently there is a need for supplementary mortality statistics, particularly for epidemiological studies. The 
main objective of this paper is to compare the quality of data on the decedent’s maximum smoking rates obtained 
by mail survey and personal interview with close relatives. In a recent study, smoking histories for a sample of de- 
ceased persons in Pennsylvania were collected initially by a mail survey of relatives. Nonresponse interviews were 
conducted on a subsample of nonrespondents to the mail query, and quality check interviews were cgnducted on a 
subsample of respondents who returned the mail form. The quality check interviewer collected the data on smoking 
habits without reference to the results obtained by mail query which were posted at the end of the schedule. After 
the interview was completed, the interviewer reconciled differences between the reports obtained by mail query 
and personal interview with the assistance of the respondent. 

The quality of the data collected by mail and personal interview is evaluated from the viewpoint of nonresponse 
bias and response differences. The effects of two separate conditions on response differences in the reported maximum 
smoking rate are noted: (1) the effects of different respondents replying to the mail query and the personal interview, 
and (2) the effects of memory distortion because the decedent was not smoking at the maximum rate during the 
l-year period before death. 





Methods of Inquiry—The Consumption Approach. ELeanor M. Snyper and Joun G. Myers, Interdepartmental 
Committee on Low Incomes, New York State. 


Investigators concerned with describing the characteristics of the population with low economic status or with 
measuring the relative size of this group within a given population are hampered by the limitations of income size 
distributions which cover income received during the period of one year. In the absence of data on income received 
over a period of several years, for identical families, indirect methods of research must be employed. Two methods 
of distinguishing families with low economic status from those with temporarily low incomes are examined. The 
consumption expenditures of individual families are employed in both methods to identify those with “permanent” 
and those with “transitory” low incomes. The first method utilizes expenditure data on specific goods or categories 
of goods; the second method is based on a classification of families by total consumption expenditures. The data are 
drawn from a series of special tabulations of the 1950 Survey of Consumer Expenditures conducted by the U. 8. 
Bureau of Labor Statistics. The first procedure was developed in a study sponsored by the Franklin D. Roosevelt 
Foundation and conducted by E. M. Snyder. This study was the subject of a paper “A Method of Identifying 
Chronic Low Income Groups from Cross-Section Data,” presented to the March 1956 Conference on Research in 
Income and Wealth. The second procedure was suggested by William Vickery and given theoretical support by the 
recent work of Milton Friedman, Franco Modigliani, and Richard Brumberg. The City Workers Family Budget is 
used as a low income “yardstick.” The results of both procedures are compared, examining the number, proportion, 
and characteristics of families with low economic status found by each method. 


A Decision Model for Computer Utilization. Morris J. Sovomon, American Greetings Corporation. 


A general model is presented which enables a programmer to choose a procedure with the least cost, given 
the probability of failure, the costs of failure, and the cost of measures to control or ameliorate the failure. Two 
decisions which are facilitated by this model are: 1. Whether a machine produced tape should be read back before 
proceeding, and 2. The desirable length of machine running between recovery intervals. 


On the Logic of Diffusion Indexes. ALsznt T. Sommens, National Industrial Conference Board. 


This paper explores the logic of diffusion indexes in terms of the systematic uncertainties introduced in the 
course of the transfer of diffusion analysis from structural studies of the cycle to mechanical forecasting. 
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Diffusion indexes take their materials from structural studies of diffusion, but focus upon timing relationships 
among specific economic series. In this sense, they are an “indicator” approach to forecasting, distinguishable from 
such approaches popular in the past by greater sophistication in the testing of indicators, and by the multiple use of 
indicators, which permits gradual changes in probabilities, and opens the attractive (but partly illusory) possibility 
that irregularities in the behavior of indicators will tend to offset. 

The insufficiencies of evidence provided by a diffusion index arise for three classes of reasons. I. Difficulties 
arise with respect to interpreting the movement of individual series. Even series with good conformity character- 
istics when viewed historically may contract, month to month, almost as many times as they expand durlng the 
expansion phase and vice versa. Smoothing devices designed to reduce the irregularities tend to erode the timing 
lead, and may distort the cyclical timing itself. In addition, the presence of secular trends, and their varying rate 
from cycle to cycle, raise uncertainties about the location of cyclical turning points or, to put it the other way around, 
introduce uncertainties into the implications of a turning point once it is determined. Reduction of irregular and 
secular components in movements can be achieved only by examining the series partly retrospectively (as in the 
form of centered moving averages) and/or by moving from binary decisions about turning points to probability 
weightings (as in duration-of-run procedures). 

II. Uncertainty enters into the interpretation of the index b of its fixed tent, fixed-weight composi- 
tion. The widely recognized “uniqueness” of each cycle is more characteristic of peaks and troughs, with which dif- 
fusion indexes are concerned, than of the continuous phases of expansion and contraction that lie between them. 
The “content”—as distinct from the “scope”—of turning points varies from cycle to cycle; the change in “scope” 
recorded in the index’s fixed range will vary depending upon whether the initial changes appear fully within, or 
only partly within, or not at all within its range. 

III. Efforts to achieve very broad coverage raise another set of uncertainties. The criteria by which individual 
indicators are judged for inclusion in diffusion indexes—degree of lead, reliability of timing, regularity of intra- 
phase movement—appear to be inversely correlated in the performance of individual series. Performance of the 
collective index against one criterion can thus be improved only at some cost, in the form of reduclng its per- 
formance against one or more of the other criteria. Finally, coverage of the collective index is debarred from in- 
cluding series whose timing characteristics are found to be “irregular,” (since the turning point in such a series, 
assuming it can be identified, carries no systematic implication for the general cycle). Irregular timing is found among 
those activities which are not controllable over the course of the cycle or which do not conform because their motiva- 
tion is not related to the cycle. The same reasons that forestall the use of this series in diffusion measures act to reduce 
uncertainties in more subjective forecasting procedures. 

a 





Cyclical Behavior of Manufactures’ Inventories, 1945-1955. T. M. Sranpack, Jz., New York University. 


This study is principally an analysis of Department of Commerce monthly inventory data for industry groups 
along with 25 commodity series for finished goods stocks. Value data for 16 industry groups have been deflated 
since 1947, nine of which were deflated separately for finished goods and purchased materials. 

Fluctuations in inventory investment are shown to have been a major force contributing to changes in gross 
national product during both post war recessions, with changes in manufacturers’ inventories playing the leading 
role among the several categories of stocks. The use of monthly and quarterly data permitted more precise statistical 
description of inventory behavior than was possible in studies of the prewar period. This fact was especially impor- 
tant in the observation of inventory investment series in which well-developed cyclical movements of rather brief 
duration oceurred. Typically, two such investment cycles occurred in our inventory series during each reference 
cycle, the second such investment cycle tending to conform closely to reference cycle turning points. Cycles in the 
durable manufacturers category showed both better conformity and greater amplitude than nondurables pointing 
up the former group as the more important source of inventory instability. Analyzed on a stage of fabrication basis 
inventory series showed strong cyclical patterns for purchased materials and for finished goods invest t. These 
patterns are described in detail and their implication for revision of cycle theory discussed. 





Ecology of Coronary Heart Disease: Statistical Implications. Morton Rosine and Atrrep M. Sretnman, M.D. 
Veterans Administration. 





The past two years have wit datr d surge of interest in the ecology of coronary heart disease. A 
conference on the epidemiology of arterosclerosis and hypertension, a symposium on the results observed in epidemi- 
ological investigations of coronary heart disease, and numerous papers on the diagnostic, therapeutic, preventative 
and etiologic aspects of this leading cause of death—all of these recent additions to the literature have whetted the 
appetite of both the medical and public health professions for more definitive data. The lay public is desirous of 
immediate programs for prevention. However, despite the suggestive and widely publicized association between 
diet and coronary heart disease, a control program does not appear on the immediate horizon. 

This paper considers the various ecologic factors currently believed to influence the pathogenesis of coronary 
heart disease. An appraisal is made of the methodological probl faced by the biometrician and epidemologist 
in verifying and quantifying these hypotheses as to etiology. 

Examination and assessment is made of the advantages and limitations of each of these methodological ap- 
proaches: (1) di graphic studies on data obtained from morbidity and mortality reports; (2) retrospective studies; 
and (3) prospective studies. 

Conclusions, or perhaps our recommendations, will suggest the following: 

(1) Studies of selected special groups believed to have an appreciable greater ptibility to coronary artery 
disease, e.g. diabetics; (2) Retrospective studies of hospital and clinic populations with and without coronary artery 
disease; (3) Intensive, short-range, community programs for morbidity reporting of each case of coronary artery 
disease with specified ecologic data. 
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How to Communicate Statistical Findings in the Field of International Markets to Management. J. WiLNER 
Sunpexson, Ford International Division, Ford Motor Company. 


Periodicity, methods and forms of presentations of economic and market materials are fully comparable with 
those used by domestic divisions of Ford Motor Company. Major differences with domestic reporting lies in avail- 
ability of international statistical and ic data as well as in comparability of material coming from multiple 
markets. 

International automotive, tractor, and parts and accessories data are relatively poor in quality, irregular, dis- 
organized, and delayed. In foreign manufacturing centers only Germany approaches comparability with U. S.; all 
others are much worse. Basic reason is that in most manufacturing countries neither government nor trade associa- 
tions collect and disseminate data as is done in the United States. Furthermore, in very few countries do leading 
manufacturers, exporters or importers informally exchange (within the trade) historical and current performance 
data 





Basically U. 8. data recognizes domestic manufacture and import sources and analyzes all stages and aspects 
of market developments. Wide variations in foreign markets result in data which cover production, exports, im- 
ports, wholesale deliveries, and registrations. Supply cycles are much longer than those of U. S. domestic market 
and reporting periods are different adding to complexity of problem of reducing world-wide sales results to compa- 
rable basis. 

Ford International Division's compromise has been to collect best available data from all sources and to reduce 
these to some unified picture. Best statistical market data represent production at manufactured sources, ship- 
ments, and wholesale deliveries by assembly plants, to dealers and distributors in export markets. Adjustments are 
made for transshipments wherever possible. 

Forecasting of market volumes is an important aspect due to time lags in implementation of supply, product, and 
facility programs. Practice has been to have staffs in large manufacturing sources develop market trend data and 
forecasts for their own domestic markets. Ford International, Manufacturing Assembly and Sales locations, co- 
operate in the development of forecasts for all other important markets. Short term estimates related to early budget- 
ing procedures. Long term forecasts organized around concept of Standard Volumes. Following formal acceptance 
of Standard Volumes these are used for profit, supply, preduct and facility and all other forms of planning. 

Ford International Division now concentrates collection of historical and current sales information and short 
term forecasting in Marketing Office. Economic and long term forecasting including Standard Volume studies 
are responsibilities of Operations Planning Department in Facilities and Operations Planning Office. 


Factors Underlying Productivity: Some Empirical Observations. Nestor E. Teruecxys, National Bureau of 
Economic Research, Inc. 


The purpose of the paper is to inquire into the possibility of finding statistically significant relationships between 
the available measures of productivity change and several explanatory factors. Productivity change is defined in 
terms of total factor productivity. The measures of productivity change are available for several manufacturing 
industries and for several periods. These unpublished data were made available by Dr. John W. Kendrick of the 
National Bureau of Economic Research. Matching measures were compiled for the following explanatory variables: 
rate of growth of the industry, amplitude and frequency of the specific cycle, capital-labor ratio, firm and plant size, 
intensity of research activity, prevalence of professional technical personnel, rate of entry of new firms, and index of 
concentration. On the basis of simple rank correlations there are indications that growth of the industry, its cyclical 
stability, and intensity of research may contribute to progress in productivity. Further statistical processing of the 
available material, particularly by multiple regressions, is planned. 


Some Estimates of Tolerance Limits from Censored Data. Eart A. Tuomas, Avco Research & Advanced Develop- 
ment. 


The problem of estimating statistical tolerance limits on the performance or reliability of equipment has re- 
ceived renewed emphasis in recent years. However for physicai, economical or accidental reasons data from the 
tails of the distribution are often missing or suspect. This situation results in data from a “censored sample.” Statisti- 
cal Tolerance Limits are defined as by Wilks, Censored and Truncated Samples are classified as by Hald, Gupta 
et al. The simple (in application) distribution free techniques of R. B. Murphy are described. The work of Epstein 
and Sobel and that of Deming and Votaw have been utilized to estimate tolerance limits from censored samples of 
two types of exponential distributions. The results of empirical experiments using data from the normal distribution 
are reported. It is apparent that combining the estimates from order statistics obtained by Sarhan and Greenberg 
with the values of Bowker yields workable results for sample sizes up to size twenty. Maximum likelihood estimates 
derived by Gupta can be combined satisfactorily with Bowker’s values for sample sizes of approximately 100 and 
greater. No adequate tables exist for intermediate size samples. 


Approximately Distribution-Free x + ks Type Tolerance Intervals. Joan E. Wausu, Lockheed Aircraft Corporation. 


This paper contains an investigation of distribution-free applications of #iks type tolerance intervals for a 
moderately general class of continuous populations. This class consists of the populations whose distribution can be 
adequately represented by a generalization of the first seven terms of its Edgeworth expansion. To first order terms 
with respect to sample sise n, the #+ ks; tolerance interval is found to depend only on s;. Here @ is computed from 
all n sample values but s; is based on a subsample whose size is of order n?/s, Using this property and dividing the 
sample into r =¢(n'/3) independent subsamples, of approximately the same size, the first order distribution of the 
#ik max (s:, ... , 8) tolerance interval is obtained. On the basis of its first order distribution, lower bounds are 
stated for upper probability points of the distribution of this tolerance interval. For sample sizes of 25 and larger, 
these bounds seem to be valid for a large subclass of the class of continuous populations initially considered. This 
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subclass appears capable of approximating most continuous populations of p it. By 
of k and n, # large number of different tolerance intervals can be obtained. For many cases of practical interest, the 
sample size required for a specified tolerance interval of this type is noticeably less than that for the corresponding 
distribution-free tolerance interval based on the largest and smallest sample values. 


Characteristics of Life Insurance Sales a..i Buyers Which Influence Persistency in Premium Payment. ALrrep G. 
Warrney, Life Insurance Agency M .nagement Association. 


This paper is a report of the development of methods for predicting life insurance premium-paying behavior. 
Since the termination of a life insurance policy represents a failure in a plan for security, life insurance companies 
are eager to find factors associated with persistency in premium payment which can be used in training their agents 
in the methods of identifying potentially persistent clients, The data for this study came from a sample of 24,817 
policies issued in May 1949. Two years after issue a check was made to determine whether or not each policy was 
still in force. An analysis was then made of the information about the policy and policyholder which is available at 
the time of sale and persistency rates were calculated for a list of variables. Further analysis of the relationship with 
persistency for the most predictive variables and of their interrelationships led to the development of a Persistency 
Rater. Copies of this Rater which now has exteasive use in the life insurance industry were given out at the time of 
presentation of the paper. 


Unemployment Effects of a Major Plant Shutdown in a Depressed Area. Ricnarp C. Wuwcock, University of 

Illinois. 

What happens when the dominant industrial plant shuts down in a labor surplus area? This situation was stud- 
ied in a southern Illinois community by the University of Illinois for the Bureau of Labor Statistics. The paper pre- 
sents some findings on the unemployment effects of the shutdown. 

Prior to the shutdown, the community had had chronic unemployment because of employment declines in coal! 
mining and agriculture. The shutdown plant bad been the dominant factory employer and its closing accentuated 
the unemployment problem. Although worker out-migration was extensive, c.n ‘tinued employment declines in the 
area and reluctance of workers to leave the community kept the unemployment ratw high. In the two years after the 
shutdown, the unemployment rate fell from 16.3 to 11.1 per cent. The labor force, however, became 13 per cent 
smaller. 

In the two years following the shutdown, the average worker from the closed plant experienced more than 
6 ployment, but workers without farm land, older workers, and those with less education had longer 
unemployment than others. At the time of the study, a third of the workers had jobs in other areas; 4 were un- 
employed, underemployed, or had left the labor force; and } had full-time jobs within the area. These and other 
data illustrate a very strong home-town attachment and help explain the persistence of chronic unemployment. 


Subject Matter Covered in Engineering Statistics at Virginia Polytechnic Institute. R. Lowert Wine, Hollins 

College. 

The two quarter course (about 60 lecture hours) is open to seniors and graduate students in engineering. No 
prior knowledge of statistics is assumed. The odject in offering such a course is to acquaint the student with basic 
statistical terms and concepts. It is hoped that on completing the course students will be able to apply elementary 
statistics and know when to cll on a professional! statistician for advice when faced with the more complicated 
statistical problems. 

Topics discussed include measures of central tendency, of dispersi population and sample, un- 
biasedness and efficiency, the normal distribution and the central limit theorem, statistical inference with emphasis 
on tests of hypotheses, analysis of variance methods applied to one-way and two-way classifications, design of 
experiments along with orthogonal comparisons and multiple comparisons tests, regression and correlation, one-way 
covariance analysis, goodness of fit tests, control charts and single sempling plans. Approximately half of the time 
is devoted to tests of hypotheses, analysis of variance, and regression analysis. 


A New Approach to the Estimation of Inventories. Ratra 8. Wooprurr, Bureau of the Census. 


It has been very difficult to obtain monthly estimates of retail inventories because most businesses do not take 
inventories monthly and it is an unreasonable request to ask them to count their complete inventories solely for 
statistical purposes. The proposal is to obtain estimates of total i tories in all retail establishments by drawing 
a random sample of establishments and within each sampled establishment drawing a small random sample of spe- 
cific commodities to be actually counted and priced. In this way the workload for any given establishment can be 
reduced to the point where either the re:pondent or an enumerator can usually make the requested counts in a mat- 
ter of minutes. This approach if successful will yield, in addition to estimates of total dollar volume of inventories of 
all dities, (1) estimates by type of commodity and (2) estimates of physical quantities either directly or in 
index form. This approach has the disadvantages (1) that the estimates are only as complete as the basic list of 
commodities used in the commodity sampling and (2) that additional variance die to the sampling of commodities 
within establishments is introduced. 

The proposed system has not yet Seen put into overall operation at the Bureau of the Census. However, 8 list 
of commodities used in retail has been developed for this purpose and experimentation on a limited scale with ob- 
taining actual counts of specific commodities has been undertaken. In addition, the principle has been used in a sur- 
vey of food stocks made by the Bureau of the Census for the Federal Civil Defense Agency. 























BOOK REVIEWS 


Nonparametric and Shortcut Statistics. Merle W. Tate and Richard C. Cleliand. Danville, 
Illinois: Interstate Printers and Publishers, Inc., 1957. Pp. ix, 171. $3.95. Paper. 


Wituram H. Krouskat, University of Chicago* 


His book is primarily an elementary exposition of applied nonparametric analysis 

for research workers in the social, biological, and medical sciences. The first third 
of the text proper is a general introduction to statistical inference. Some short-cut 
parametric techniques, mostly involving the use of sample ranges, are briefly dis- 
cussed. There are 28 pages of tables and graphs. Numerous problems with real data 
are set, and answers or discussions for them are collected at the end of the book. 
Printing appears to have been done directly from typescript. 

The book, in my opinion, does not adequately fulfill its purposes, and the world 
still awaits an acceptable introductory textbook for applied nonparametric analysis. 
There are several general reasons for this critical evaluation. First, the book shares 
with its competitor by Signey Siegel [8] an unfortunately high density of errors and 
ambiguities.' Second, examples are discussed in a formal, shallow manner so that the 
reader keeps asking “But what are you really trying to find out? What about the 
sampling method? What are possible biases? What are the alternative hypotheses?” 
Third, several important references are not listed, nor are their contents reflected in 
the text. 

On the positive side, the authors are to be congratulated for paying some attention 
to techniques other than tests of no-difference hypotheses, especially confidence in- 
tervals, and for discussing not only advantages but disadvantages of nonparametric 
methods. Unfortunately, they do not go nearly far enough in their discussion of 
confidence intervals, and their presentation of the limitations of nonparametric 
procedures is incomplete and sometimes in error. 

Book reviews in this and other statistical journals complain again and again of a 
high density of errors. Does this complaint stem merely from the querulous pedantry 
of overmeticulous reviewers? I think not, but rather than standards of accuracy 
and clarity for statistical texts are unhappily low, perhaps lower than those for texts 
in other fields. The only long-run remedy for the problem will be a refusal of the 
statistical public to recommend and buy inferior texts. 

It might be said that the responsibility for accuracy and clarity is that of the 
author. But I feel that, given the present complex relationship between publishers 
and the scientific community, a considerable responsibility also falls on the publisher. 
He is in a position to see that manuscripts are carefully read by independent com- 
petent referees, and to urge appropriate corrections upon the authors. Both in the 
case of Siegel’s text and the present one, such a procedure would have resulted in 
incomparably better books, and propagation of error would have been materially 
reduced. Good refereeing, to be sure, must be paid for, in time and energy on the 

* Several people have commented on a draft of this review, and their remarks have been helpful in revising the 
draft for greater clarity. The opinions expressed in the review are, of course, not necessarily shared by those who 
commented. Besides my colleagues at Chicago I should like to thank the following: J. Gilbert (Center for Advanced 
Study in the Behavioral Sciences), F. Mosteller (Harvard), I. R. Savage (Minnesota), and J. Stanley (Wisconsin). 
In addition, I wish to express my appreciation to M. W. Tate and R. C. Clelland for their gracious and useful 
tees book has been competently and carefully reviewed by 1. R. Savage in this Journal [7]. Many of 
Savage's criticisms of Siegel's book apply as well to the book by Tate and Clelland; and Savage lists alternative 
expository sources. 
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part of the referee, and in cash on the part of the publisher. But five or six hundred 
dollars paid for refereeing, especially when spread over sales of, say, 4000 copies is 
not too onerous. I would far rather pay $4.15 for a properly refereed book than $4.00 
for one that is heavy with troublesome slips, ambiguities, and downright errors. 


APPENDIX 


The remainder of this review lists some of the more egregious or otherwise note- 
worthy errors, ambiguities and omissions that I noted in reading the book, and discusses 
contents and tables, especially in comparison with Siegel’s book. 


P. vi. “. .. nonparametric test probabilities [P values] as large as about .20 tend to be associated with con- 
ventional rejection levels under more powerful tests.” This startling statement appears in the preface and I find no 
evidence given for it there or elsewhere. It might mean that, if we consider samples that have led to conventional 
rejection level P values with parametric tests, then we tend to find appreciably higher P values with nonparametric 
tests. Or it might mean that actions taken when a parametric test gives a P value of about .05 tend to be the same 
as those taken when a nonparametric test gives a P value of, say, .15. In either case, I question the statement, al- 
though I do not think anyone has studied either interpretation. Whatever the motivation, nonetheless, we should 
be pleased that the tables given by Tate and Clelland run up to .20 signifieance levels. 

P. 2. “Only a random ple can be idered unbiased and therefore representative ....” First, it is not 
made clear that it is the method of sampling that is random, unbiased, etc., not the particular sample at hand. 
Second, other methods of sampling may well result in unbiased and representative results, however those adjectives 
may reasonably be defined. For example, stratification and clustering are well-known devices of sampling that do 
not give random samples in the strict sense. This point is hinted at on p. 24 but not made clear. 

P. 3. “Sample data . . . are of two broad kinds, quantitative and qualitative. The former . . . can be arranged in 
order of size. Qualitative data do not allow such an arrangement.” Surely any book in part for social scientists should 
consider ordered categories for qualitative data. 

P. 7. *... [sample cumulative distribution] curves do not provide any very exact information about the 
data.” On the contrary, they provide, for random samples, all the information about the data. Further, two well- 
known nonparametric procedures, both discussed in this book, are directly based on the sample cumulatives. 

P. 9. “Inferences drawn from a sample median about the population median, although not as accurate as those 
drawn from a sample mean, are safer and more general in the sense that they do not rest on the assumption cf 
normality.” But a sample median may be quite irrelevant to the population mean, and when it is relevant, it may 
provide a more accurate estimator than the sample mean. 

P. 11. “It ie ti ient to epeak about [the] modal ... category [for qualitative data], but used 
this way the mode has no statistical meaning.” 

P. 15. *[Kendall’s r] is not as good as [Spearman's rg] in describing relationship for it [r] is not directly 
related to the product-moment coefficients.” Why should direct relationship to the product-moment coefficients be 
a criterion? 

P. 16. “+... is always smaller than rg except near unity or zero, where the two tend to agree.” It is true that 
when r or vz is near + 1, so is the other, but the remaining two parts of the statement are false or misleading as may 
be seen from the following example. Consider the paired rankings 

1 2 3 4 5 6 7 8 9 10 

a) 10 1 2 3 4 5 6 7 8 
for which rg =1/33 and r =13/45. Also, the quoted statement is ambiguous about whether absolute or algebraic 
values are meant. Correct inequalities between rg and r are given, for example, by Kendall [3]. 

P. 24. The introduction to hypothesis testing discusses a matched pair experiment without a word about 
randomizing treatments. 

P. 36. After some fine comments earlier about the importance of considering alternative hypotheses in framing 
a test, the test proposed here for a simple multinomial null hypothesis is obtained by including in the critical region 
sample points with lowest probabilities under the null hypothesis. 

P. 42. “When we have a sample statistic, we may use it to estimate the corresponding parameter. This is known 
as point estimation.” This seems to put the cart before the horse. The first problem in point estimation is to decide 
what population characteristics are of major interest. 

P. 44. and many other places. The reader gets an impression from this book that only for underlying normal 
distributions are parametric methods useful. 

P. 52. Concerning a x? test on a trinomial population, “The fact that we cannot state a definite alternative makes 
it impossible to determine the power of the test. Nor is it possible to estimate [sic] confidence intervals for the [popu- 
lation proportions] . . . .” But approximate power functions and confidence sets in this case are, if not well known, 
at least known to many. Power is discussed, for example, by P. B. Patnaik [6]. Cor fidence intervals for a single one 
of the trinomial or multinomial probabilities may be found just as in the binomial case. Approximate joint con- 
fidence ellipsoids may be obtained for all the probabilities by including in the confidence region, in standard fashion, 
all those sets of probabilities that would, as null hypotheses, be accepted by the sample at hand. 

P. 53, Example 21. This gives the post positions of the winning horses in 144 races, and the reader is asked 
whether it is reasonable to believe that post position doesn’t affect the chance of winning. The “correct answer” at 
the back of the book is to apply a standard x? test. Here is a perfect place to consider ordered alternatives, but not 
a word is said.‘It is of interest that Siegel discusses the same example [8, 44-6] and does mention the order question, 
although only in a footnote and without presenting any appropriate method. 














BOOK REVIEWS 597 


P. 56. Brief mention is made of application of the Wilcoxon signed rank test to a single ple, but no ry 
remark appears about the special nature of the test in this case. (When it is used to test or estimate a translation 
parameter, symmetry must be assumed.) 

P. 60. *... [a disadvantage] of the conventional 42 test for goodness of fit [is that] if the signs of the differ- 
ences are alike in several consecutive intervals of the distribution, the test may fail to detect significant departures 
from the assumed curve ... nothing can be done about [this] . . . except care in interpretation of the test.” The 
disadvantage is not clearly discribed and one can take signs of deviations into account. 

P. 63. In finding graphically the maximum vertical deviation between hypothesized and sample cumulatives it 
is suggested that probability paper be used that distorts the vertical scale so that the hypothesized cumulative is a 
straight line. This may make it rather hard to find the maximum vertical deviation. 

P. 74. In discussing Fisher's exact 2 X2 test (not identified by this name) no reference is made to the several 
available tables for it, nor is any reference made to Table J of the Tate-Clelland book itself which provides some 
approximate critical values for the 2 X2 situation. 

Chap. IV. In the discussion of chi-square tests, despite earlier statements of good intention, there is no discus- 
sion of power. 

Pp. 78-82. The discussion of nonparametric linear regression considers only the Brown-Mcod method and omits 
any mention of the Daniels method [2]. 

P. 84. “None [of the two sample nonparametric tests] lend themselves to quick or accurate estimation of con- 
fidence limits.” The Wilcoxon tests lead to quite easy graphical methods for obtaining (not estimating) confidence 
limits. Details are partly discussed by Moses in [5], a reference listed by Tate and Clelland. Th same point arises 
on p. 103. 

P. 8. “The connection between the contingency test of independence and the test of sample proportions arises 
from the fact that the data in two samples may be classified in the 2 X2 table.” This is a little like saying that the 
connection between adults and children is that both have four limbs. 

P. 90. “The [Wilcoxon two-sample } test is the nonparametric test most nearly analogous to the standard 
parametric ¢t test... . ” Surely, it is not as analogous as the permutation test on the observations using the ¢ statistic, 
nor as analogous as the Fisher-Yates normalized rank method. Further, any test based on permutations of a set of 
preassigned numbers, whether ranks, normal order statistics, or some other set, is at least as much an analog to the 
t test as Wilcoxon's test. A parallel comment may be made about the same remark in the paired two-sample case 
(p. 101). 

P. 90. “It [the Wilcoxon two-sample test } assumes neither normality nor homogeneity of variance.” It does, 
as usually presented, assume not only identity of variance but idertity of distribution under the null hypothesis. 
To be sure, it may not be so very sensitive to some aspects of this assumption, but this is another matter. 

P. 100. Of the sign test applied to the so-called correlated 2 X2 situation: “It would be very difficult to study 
the power of the test and, practically speaking, idle to do so.” I disagree heartily on both counts. 

P. 103. The “correction for ties” in the case of the Wilcoxon paired sample case is not given. 

P. 105. The Fisher-Yates normalized rank procedure is mentioned in passing, but without any description. 
There is confusion between power against normal alternatives and others. 

P. 115 and following. This discussion of the Brown-Mood median tests for two-way analyses is based wholly on 
Mood's textbook and fails to mention an important subsequent article by Brown and Mood [1]. 

P. 123. The reference to Kendall's monograph on rank correlation is only to its first edition in 1948, without 
mention of the more recent and expanded edition in 1955 (3). 

Comparative Contents. Tate and Clelland discuss two general a:eas that are not discussed by Siegel. First, non- 
parametric confidence intervals, and, second, short-cut normal theory methods. Two specific topics in the Tate and 
Clelland book and not in the Siegel book are (1) the Tukey-Olmsted corner test of association, and (2) the Brown- 
Mood analogs, based on counts above and below medians, to linear hypothesis problems. Siegel considers the 
Moses extreme reaction test, while Tate and Clelland do not. Siegel discusses permutation tests on the observations 
in more detail than do Tate and Clelland. Neither book discusses runs of kinds other than above and below the 
median and the Wald-Wolfowitz two-sample runs test. Neither book discusses in any detail problems and approxi- 
mations connected with the Poisson distribution. Both books are weak in their discussions of power, and both 
omit discussions of design problems. 

Tables. The tables and graphs given by Tate and Clelland are much the same as those given by Siegel, except 
that Tate and Clelland in general give critical values for a larger number of significance levels, typically 1%, 5%, 
10%, and 20%. This is a distinct advantage, since one usually wants to know at least an approximate P-value, not 
just an accept-reject dichotomous decision. Also, Tate and Clelland include the Link-Wallace tables for short-cut 
testing and multiple comparisons in the balanced one- and two-way analysis of variance cases, along with Mosteller's 
graph for short-cut estimation of a normal correlation coefficient. 

A first glance at many of the tables shows scattered blank spaces. that may puzzle some readers. For example, 
in Table E, two-sided .ritical values are given for testing a null hypothesis on a binomial proportion in terms of 
number of observed “s.sccesses.” For sample size 25 and null hypothesis proportion .5, the table reads as follows: 





Significance level Critical values 
-20 8,17 
10 
05 
01 


Why the blank for significance level .10? The answer is, of course, that because of discreteness, if one wants a sig- 
nificance level <.10, one actually obtains a level <.05 and this might as well be recognized. However, the point 
should be, and is not, made clear for the intended reader of the book. 
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Table D (confidence intervals for 50%, 75%, and 90% quantiles) and Table E (explained above) overlap in 
large measure, but they are not everywhere consistent. For example when sample size is 50 and significance level .2, 
we may compare the first column of D with the last column of E. Both cannot be correct. (In the last column of E, 
with N=40, a=,.05, and ¢=.5, “17” is a misprint for “27.”) 

In fact four of the tables (D, E, N, and ©) all give in effect critical values for the sign test, but this extensive 
overlap seems to be mentioned only once (p. 35) very briefly. A superficial check turns up discrepancies beyond the 
one mentioned above. For example, with sample size 30 and two-tail significance level .2, Table E tells us that 11 is 
the critical number of successes in the left tail, while Table O shows a blank implying that 10 is critical. (The actual 
two-tail P-values are .2005 and .0987 respectively.) Again, for sample size 50 and two-tail significance level .2, 
Tabie O tells us that 20 is the critical number of successes at the left, while Table N gives 19 as the corresponding 
critical value. (This is for the same situation as in the above paragraph; Table E gives 20 and Table D gives 19. 
The actual two-tail P-values are .2026 and .1189 respectively.) 

This is an appropriate place to interject a plea that tables of critical values for discrete statistics include 
P-values. The extra room needed is often a small price to pay for the added information. 

Tate and Clelland give a compact table of approximate critical values for the 2 X2 contingency table test cover- 
ing a wide range of sample sizes. Siegel gives a much more detailed table for small sample sizes and no table for 
larger ones. 

Tate and Clelland give two-tail critical values for the Wilcoxon signed rank test with sample sizes 4(1)30 and 
significance levels .01, .05, .10, and .20. Some of the values for level .20 appear in fact to correspond to P-values 
slightly greater than .20. 

Table Q, giving critical values for the H test, has many errors, mostly small, but a few of consequence. Some of 
these stem from taking, say, a value of H as critical at the .01 level when the actual P-value is .011. Others appear 
to come from lack of reference to the correction note published by Wallis and me as a supplement to our paper 
presenting the A test [4]. One or two appear to be typographical errors. Siegel's table for the H test is much more 
complete and I have noted no errors in it. 

Tate and Clelland give clearly drawn charts of the Clopper-Pearson kind for tests and confidence intervals on 
binomial proportions. Significance levels are .01, .05, .10, and .20 (two sided). 
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The Mathematical Theory of Sampling. Walter A. Hendricks. New Brunswick, N. J.: The 
Scarecrow Press, 1956. $7.50. 


T. A. Bancrort, Iowa State College 


HE author states that this book is addressed to students who wish to become 

familiar with formulas and computations incident to sample design and the 
analysis of sample data. He states further that although major emphasis is given to 
sample surveys, most of the methods have a broader field of application. The book 
takes up applications largely from the viewpoint of agricultural surveys “because that 
happens to be the author’s field of activity.” 

Chap. I presents an introduction to basic concepts in sampling and discusses the 
role of statistical methods in sampling. Chap. II deals with the case of random sam- 
pling with equal probabilities of selection for infinite and finite universes. Expected 
values, averages, and standard errors are presented. Chap. III takes up probability 
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theory and its relation to sampling including a discussion of Boyes’ Theorem, binomial 
variance and approximate confidence limits, sampling without replacement from a 
finite universe, and variances and covariances in the binomial and multinomial dis- 
tributions. 

Chap. IV covers random sampling in a universe defined by a continuous frequency 
distribution with emphasis on derived sampling distributions based on a normal 
parent population distribution. 

Stratified sampling is the subject of Chap V which includes a discussion of optimum 
sample allocation, the relationshiop of costs to sample allocation, and the effects of 
errors in stratum weights. 

The author takes time out in Chap. VI to present a rather detailed discussion of 
the analysis of variance including: equal and unequal numbers per stratum, data 
grouped in subclasses, two-dimensional stratification of data, two-way classifications 
with unequal cell frequencies, and studies of attributes. 

The remaining chapters treat the usual topics in survey sampling: Chap. VII 
Multistage Sampling, Chap. VIII Sampling Units and Their Selection, Chap. IX 
Sampling with Unequal Probabilities of Selection, Chap. X Ratio and Regression 
Estimates, and Chap. XI Control of Errors in Sample Surveys. 

Numerous numerical and theoretical examples are given which should prove help- 
ful in understanding the derivations and exposition; however, no exercises are in- 
cluded. References given at the end of the chapters provide a fairly satisfactory 
bibliography in survey sampling, and also in general statistical methods and theory 
up to the intermediate level. The one exception voted is the omission of any reference 
to Sampling Theory of Surveys with Applications by P. V. Sukhatme. 

The author has combined a theoretical approach involving some mathematical 
literacy with numerous simple numerical examples to illustrate the ideas and to in- 
dicate the nature of applications. Also considerable space is devoted to topics in gen- 
eral statistical methods and theory which other texts in survey sampling assume as 
prerequisities. As a consequence the book is hardly suitable as a beginning methods 
course in survey sampling of the type of Volume I of Sample Survey Methods and 
Theory by M. H. Hansen, W. N. Hurwitz, and W. G. Madow. On the other hand, 
no attempt is made to present a complete formal mathematical treatment and hence, 
in this sense, it cannot be classed as an advanced text. Nevertheless, this reviewer rec- 
ommends this book to students of survey sampling who have some knowledge of 
mathematics through calculus and who wish a self-contained treatment of the subject 
including related topics in general statistics at the intermediate level. 


Extension of Existing Tables of Attributes Sampling Plans. J. M. Wiesen. Albuquerque, 
New Mexico: Sandia Corporation, 1956. Available as “Physics (TID-4500, 13th Edition),” 
Office of Technical Services, Department of Commerce, Washington 25, D. C. Pp. iii, 236. 
$1.25. Multilith. Paper. 


Davin R. Howes, Chemical Corps Engineering Command 


His document presents 230 curves for each of the acceptance numbers 0, 1, and 2, 

for sampling plans with lot sizes of 100 to 500 by steps of 50 and from 500 to 2000 

by steps of 100. Sample sizes are up to one half of the lot by an irregular rule starting 

at 25 for a lot of 100, and 200 for a lot of 2000. Since the sample sizes are large with 

respect to the lot size, hypergeometric probabilities are used. The charts can be read 

to an accuracy of .05 per cent in per cent defective and to .2 per cent in probability 
of acceptance. 

The report is designed to provide operating characteristics for single sampling 

plans of very high discrimination for use where sequential or variables sampling is 

impractical. It may also be useful as a source of hypergeometric probabilities as an 





600 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 


alternative to the use of the approximations given by Dodge and Romig as formula 
(1’), p. 43, in Sampling Inspection Tables (Wiley, 1944). For practical purposes, it 
would be helpful to have, for given sample sizes, charts showing selected percentage 
points of the hypergeometric distribution as a function of lot size. 


Tables of the Normal Probability Integral. D. B. Owen. Albuquerque, New Mexico: 
Sandia Corporation, 1957. Available from the Office of Technical Services, Department of 
Commerce, Washington 25, D. C. Pp. 58, $0.40. Paper. 


Hans O. Bremer, National Bureau of Standards 


7 publication was prepared primarily as a working paper for use within the 
Sandia Corporation. It contains tables of the normal probability integral and 
Mill’s ratio. 

The form of the normal probability integral that was tabulated is 


G(h) = (1//2z) f. ens"de 


and is tabulated for +h=0(0.001)4(0.01)7 with the function being given to eight 
decimal places. For extreme values of h, the normal probability integral G(—h) and 
Mill’s ratio are given for +h =50(1)150(5)500 to eight significant places. 

These represent an expansion of previous tables given by E. 8. Pearson and H. O. 
Hartley in Biometrika Tables for Statisticians, Vol. I, Cambridge University Press, 
London, 1954, pp. 104-11. 


Metzger’s Table of 10,000 Square Roots. William T. Metzger. Indianapolis, Indiana: Re- 
search Tabulating Bureau, 1958. Pp. 21. $2.50. Paper. 


bee table shows each number from 1 to 10,000 and its square. The square root of 
a given number is obtained by finding the location of the given number in a column 
of squares, and then noting what two four-digit numbers produce squares on either 
side of the given number. 

The table is reproduced from tabulating machine print-outs in rather small type. 
It consists of 100 columns of 100 entries each. 


W. A. W. 


Metzger’s Table of 45,000 Per Cents. William T. Metzger. Indianapolis 6, Indiana: Re- 
search Tabulating Bureau, P. O. Box 7, Dept. 434, 1957. $4.00. Paper. 


ENOMINATORS from 1 to 300 are represented by columns and numerators from 1 
to the value of the denominator are represented by rows. The entries then give 
the percentages to five decimals. 
The table is reproduced from tabulating machine print-outs in rather small type. 
W. A. W. 


Models of Man. Herbert A. Simon. New York: John Wiley and Sons, 1957. $5.00. 
May Bropseck, University of Minnesota 


HE descriptive phrase “Mathematical essays on rational human behavior” is 
helpfully appended to the title of this book. More specifically, these papers on 
diverse topics in social science, exclusive of economics, are presented as a contribution 
to quantified theory of the social psychology of groups and organizations. Their 
unity, as the author conceives it, lies in how much they contribute toward “laying 
foundations” for a science of man as a social and rational animal. Whether or not 
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they actually have this unity is doubtless just another instance or facet of the aggre- 
gation problem. My own reaction to the aggregation is ambivalent. Since Simon’s 
motives are admirable and his ingenuity is considerable, this qualified enthusiasm 
requires some explanation. 

The essays are divided into four sections. Each section has a preface in which Simon 
discusses the connections among the parts and elucidates some of the special problems 
raised. Many of these remarks are, as one would expect, wise and illuminating com- 
ments on problems of measurement and quantification in social science. Here indeed 
is one source of the reviewer’s ambivalence. One would as soon censure Motherhood 
as the ideals Simon professes. Scientific terms must be operationally defined. Of 
course. Quantification is highly desirable—and not just because physicists do it. Of 
course. A truly unified science of human behsvior is devoutly to be wished for. 
Indeed. Nor, given the present state of social science, outside of economics and cer- 
tain branches of theoretical psychology, is it superfluous to rehearse these maxims. 
Indeed not. Yet, two things trouble me. The first is Simon’s misplaced preoccupation 
with the analysis of causality. It is misplaced because, among other things, he quite 
incorrectly, it seems to this reviewer, believes such an analysis to be necessary for a 
proper operational definition of “power” or “influence” in the social or political sense. 
The second qualification is the “level” of theory at which Simon constructs his major 
“unifying” model. I shall discuss these one at a time. 

The first section is on “Causation and influence processes.” The very concatena- 
tion of these two notions is odd. Causality is a methodological notion, while influence 
between persons, as in “Smith influences Jones,” is a natural or descriptive relation 
like “loves” or “taller than.” In the first paper, “Causal ordering and identifiability,” 
Simon develops a method for establishing a causal order among the variables of a 
theory and a corresponding order of “precedence” among the equations containing 
these variables. Causa] imputation is given, according to Simon, “operational” 
meaning by associating with each equation a procedure for altering its constant terms 
or coefficients. Econometricians will doubtless be interested in the parallelism Simon 
draws between the conditions for establishing a causal order and those for the iden- 
tifiability of parameters. However, “operational definition” is a good thing, all right, 
but only where appropriate, namely, to descriptive or empirical concepts or variables, 
particularly those involving measurement. “Causality” is not a descriptive concept, 
that is, a name for a kind of observable thing, event, or attribute cf or relation be- 
tween things or events. The problem of analyzing ‘‘causality” is thus not a problem 
of proper concept formation nor, tpso facto, of “operational definition.” It is, rather, 
part of the problem of knowing when we have a true empirical law or correlation and 
when we have an accidental conjunction or a “spurious” correlation. On this problem, 
Simon has useful things to say in a separate essay, about the assumptions scientists 
make when distinguishing true from spurious correlations. These assumptions imply 
the need for a context of knowledge, that is, a theory about the irrelevance of certain 
variables or the lack of correlation among random disturbances associated with given 
relevant variables. They do not imply or require an “operational” definition of 
causality, for there is no such thing. There is no such thing because “causality” is 
not a variable of the law or theory we are testing. Simon knows all this, of course, so 
he speaks instead of the “operational definition of an equation.” By this he means 
associating with the equation a procedure for manipulating the coefficients. Un- 
fortunately, I do not know what it means to define an equation, operationally or 
otherwise. For one thing, a definition is unequivocal, while the ways variables can 
be manipulated may be legion. I do know that this over-extension of “operational 
definition” to places where it is not applicable is more confusing than clarifying. 
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One consequence of this confusion is Simon’s belief that the definition of influence 
requires a prior definition of causality. In the same section is reprinted a paper on 
the “Observation and measurement of political power.” Since “power” or “in- 
fluence,” which he uses synonymously, is an asymmetrical relation between persons, 
Simon fears that, like “cause,” it may be a suspect notion. But, unlike “cause,” “in- 
fluence” is a descriptive relation, hence it can be measured, correlated with other 
things, and so on. It is really not necessary to point, as he does, te such recondite 
things as electrical network theory in order to justify the use of asymmetric relations 
within science. Is “harder than” something to be “wary” of? The characterization of 
power or influence no more depends upon a prior analysis of “cause” than does the 
definition of “harder than.” An operational definition of “Jones influences Smith” or 
“Jones’ behavior causes Smith’s behavior’ might be “If Jones talks, then Smith 
jumps,” just as “harder than” is defined in terms of scratching. No theory of causality 
nor even of the difference between a true and a spurious correlation is needed for an 
operational definition. Needless to say, Simon’s practice is better than his theory, as 
is shown by his subtle discussion of the problems in constructing a useful definition 
of “influence” and in the choice of units of observation and measurement. 

Section II, “Social processes,” deals with man as a social animal. These models are 
“mathematical” or, as I prefer to say, quantitative reformulations of nonquantitative 
theories of the behavior of individuals in groups—their communication processes, 
the stability of the group, ete.—as developed by the social psychologists Homans and 
Festinger. Such “translations” are not only, as Simon points out, clearer and more 
rigorous, they also say a great deal more than the original. This is both a virtue and a 
weakness. The virtue is obvious. The weakness is that by the nature of the mathema- 
tical machinery and the data, the quantified theory will generally, even if not in these 
particular cases, say more about fewer variables. This is partly because of the number 
of equations that can be handled at one time and partly because many of the relevant 
variables resist quantification. A nonquantified theory, on the other hand, may say 
more, though less precisely, by stating the connections among a greater number of 
variables. One may therefore purchase the virtues of quantification at the price of 
theoretical triviality. This need not happen, of course, as theoretical economics shows. 
But I shall revert to this case in a moment. In contrast with the models of “social” 
man as an adaptive being who engages in group activity for its own sake, the two 
papers in Section III, headed “Motivation: inducements and contributions,” present 
models of man as an econemic maximimizing animal, for whom organizations are 
purely instrumental to exterral rewards. The concepts of classical economic theory 
are here applied to theories of organization and of the employment relation. The 
final, fourth section, on “Rationality and administrative decision making” is in 
many ways the most interesting part of the book. It attempts to unify in a new 
theory certain features of social and rational man. Simon here argues that a “more 
realistic” definition of rational behavior is needed, one which takes account of the 
empirical limits, within the individual and in his environment, on human rationality. 
Economic, maximizing, omniscient man gives way to “satisficing” man of limited 
knowledge who settles for a course of action that is “good enough” rather than the 
best of all possible alternatives—which in any case he never knows. Simon’s definition 
of “approximate” or “intended” rationality incorporates social and psychological 
variables omitted from both classical economics and more recent game and statistical- 
decision theory. These variables, drawn from “common experience,” help provide a 
definition of rational choice which is more consonant with how people “actually” do 
behave. 

Quite apart from any difficulties in recent economic theory, Simon’s attempt to 
replace the unrealistic “ideal type” of economic man by something more true to life 
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seems inherently desirable, on the face of it. Yet, I am less persuaded of the virtues 
of this project than perhaps a good empiricist should be. Thus far, the most fruitful 
theory-construction in social science has been either of two kinds, At one extreme, 
as in learning theory in psychology, the behaviors studied are of the rudimentary kind 
exhibited by, say, the white rat. (This is why Simon must rely on conjecture for the 
variables in “actual” human rational behavior.) Quantification is possible, but pre- 
diction of complex, human behavior waits upon the development of composition 
laws, that is, laws stating how the laws of the elementary processes combine to result 
in complex human behavior, e.g., so-called “insight” learning in man may be ex- 
plained in terms of composition laws based upon the laws of rudimentary trial-and- 
error learning. If, as there is reason to believe, human behavior consists of complex 
patterns of elementary processes, the potential scope of such theories is very great. 
At the other extreme, in theoretical economics, highly complex and sophisticated 
aggregate human behavior is explained by means of assumptions about rational be- 
havior which, for any individual, are most likely false. Yet, for macrophenomena, 
they work rather well. Unlike either learning or economic theorists, Simon at- 
tempts to construct directly a theory for individual choice behavior. This, at the 
level of social psychology, is neither very basic nor very macroscopic. Simon has the 
advantage over most social psychologists of mathematical ingenuity and understand- 
ing of what has already been done in economics. Whether the probable lack in either 
actual or potential scope of theory which is neither fundamental nor highly aggrega- 
tive can be compensated for by these advantages remains to be seen. 


The Demand and Supply of Scientific Personnel. David M. Blank and George J. Stigler. 
New York: National Bureau of Economic Research, 1957. Pp. xix, 200. $4.00. 


M. BRoNFENBRENNER, Michigan State University 


HIs is the most recent and should prove one of the most controversial of the Na- 

tional Bureau studies in service industry labor markets. The series is a monument 
to one of the present authors, George J. Stigler, who has been associated prominently 
with every volume. 

Unfriendly critics have labelled the National Bureau a “figure factory” in the tradi- 
tion of “Knowledge for what?” Such charges are definitely out of place with respect 
to these service industry labor market studies, Chapter and verse defy detailed cita- 
tion, but the studies all stress the beneficent operation of the free market in the long 
run, adapting the quantity supplied to changes in demand, or vice versa, via the 
price mechanism. 

The same invisible hand is equally visible in the present unusually timely study. 
Under the general head of “scientific personnel” major emphasis is placed on the 
market for the services of engineers, who dominate the category quantitatively and 
for whom the most and best data are available. Secondary stress is on chemists. 
Mathematicians and physicists are dealt with together in Chap. 5, and there are a 
number of useful but scattered references to the problems of “earth scientists” (geol- 
ogists and geographers). Mathematical statisticians are not considered separately 
from other mathematicians, and other statisticians are excluded entirely. Despite 
the sharp increase in employment opportunities since 1939 in all the fields covered, 
supply has kept pace, with possible dilution of quality, at prices rising proportionately 
only half as much as the wages of manufacturing labor. When shortages were most 
acute (1946 and 1950) the incomes of scientific personnel were lagging furthest be- 
hind, and reduction of the lag alleviated the shortages. 

It may be informative to reproduce some of the authors’ indices, in view of the 
widespread belief that incomes of scientists have risen inordinately. Table 11 (p. 25) 
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compares indices of median engineering salaries to average wages, salaries, or net 
incomes in a number of other occupations. Taking the 1929 value of each ratio of 
median engineering salary to the corresponding figure for another occupation as 100, 
the most recent values computed by Blank and Stigler are: to all full-time wage and 
salaried employees, 67.9 (1954), to full-time manufacturing wage earners, 66.6 
(1954), to physicians, 59.5 (1951), and to dentists, 83.5 (1951). Only the ratios to 
lawyers (96.0—1954) and to college teachers (96.3—1953) show the engineers holding 
their own. If 1939 instead of 1929 had been chosen as a base year, all the comparisons 
would have been less favorable to the engineers, although in varying degrees. 

A statistically interesting feature of Blank and Stigler’s analysis explores the 
tendency for scientific personnel to appear when needed “out of the woodwork,” 
meaning that increases in their number are more rapid than can be accounted for by 
the net output of accredited training institutions. Anyone who has kept his ears open 
around, say, an aircraft or electronics plant knows what has happened, namely up- 
grading, particularly the application of the “engineer” title to intelligent technicians 
trained on the job. But this reviewer at least did not realize the number of such 
cases—apparently some 40 per cent of the engineering profession (p. 86). 

The implicit advice to trust the market, keep one’s price system dry, and resolve 
all shortages by income increases, is not currently popular as regards scientific per- 
sonnel, thanks to recent Soviet advances (ICBM, Sputnik). One cannot, however, 
criticize this volume’s pre-Sputnik complacency as wrong; at the very worst, it may 
be partially irrelevant. It shows how the market equates the supply of scientists to 
the demand. The arguments of the alarmists are that the supply and the demand are 
both too low to keep pace with the Soviet Union, and that the price system allocates 
the supply wrongly (too many engineers in gadgetry and salesmanship, too few in 
research and teaching). Blank and Stigler’s careful statistics provide no clues for 
evaluation of these assertions. But neither does any alternative body of scientific 
analysis to supplement sheer intuition. 


New Technology and the Supply of Petroleum: the Treatment of Uncertainty in Resource 
Planning. Miller B. Spangler. Chicago: University of Chicago Press, 1957. Pp. xiii, 299. 
$3.50. (Obtainable from the author, R.F.D. 1, Yorktown Heights, New York.) 


Joun 8. McGepr, The University of Chicago 


ucu has been said about the uncertainties that enshroud crude oil discovery and 

production. The first successful oil well in the United States was drilled in 
1859, and periodically ever since controversy has developed over the volume and 
adequacy of domestic oil reserves. Many predictions have been made. Those that 
could be tested have generally proved to be grossly inaccurate. 

The purpose of this volume is to develop better methods for predicting the dis- 
covery and development of aggregate crude oil reserves in the United States. Orthodox 
methods, including projection of time series, have not been notably successful. What 
is wanted, according to the author, is a new technique for dealing with complex un- 
certainties. The solution is to rely upon expert opinion. Experts can prepare subjec- 
tive probability estimates for each expected technological innovation, together with 
estimates of its impact upon domestic oil reserves. Nontechnological factors are 
taken to be constant. 

Underlying this approach is, among other things, a specia! conception of innovation 
itself. These are the basic tenets: (1) Innovation is accelerating. (2) Specific innova- 
tions are more or less predictable, since there are “orderly steps proceeding from 
fundamental research ... to developmental research” (p. 7). (3) Because laws of 
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technological development are analogous to biological laws, experts can determine 
the relative maturity of specific techniques and processes, and how much additional 
development is possible in various fields. (4) It is possible to estimate what kind, as 
well as how much, innovating there remains to be done, and what effects it will have.! 

By many standards, the United States is a relatively mature oil province, and the 
physical task imposed upon discovery and development has grown harder. Never- 
theless, technology has largely filled the gap. As a consequence, the real price of 
domestic crude oil has remained “rather stable for the past several decades.” (p. 
44). The outlook is for continuing technological progress, since the industry is increas- 
ing what are already large research expenditures. 

The basic question posed by this study is, in a sense, whether there will be enough 
innovation. Lacking the probability estimates of a corps of experts, the author hints 
at his own. He distinguishes three main areas for future technological development: 
discovery, field development, and recovery. For each he discusses, in considerable 
detail, the most significant past innovations; appraises the relative maturity of 
existing techniques; and cites present “problem areas” which are peculiarly attrac- 
tive to innovation. He then reaches some broad conclusions about future domestic 
oil reserves, insofar as they are technologically determined. He concludes that “the 
supply of domestic petroleum through 1975 appears to hold good promise of meeting 
demand [at approximately present prices] . . .” (p. 293). 

This study raises many questions. The author stresses the need for a method of 
prediction that is both “practical” and superior to the old techniques. The operability, 
as well as the meaning, of the one he suggests is at least questionable. Neither is it 
clear that guesses based upon guesses will be more useful than those orthodox tech- 
niques that the author eschews. Furthermore, the author does not present a sys- 
tematic method for synthesizing (and interpreting) disparate subjective estimates 
of different expected innovations. There are also some missing steps that make this 
treatment difficult to follow. For example, the analysis is directed primarily to es- 
timating reserves at some future time. But reserves is a stock which has been large, 
but not entirely stable, relative to yearly additions and withdrawals. The analysis 
of price requires some notion about the withdrawals from stock per unit of time, and 
the relationship to reserves is not spelled out. Finally, there are some observed facts 
that call the main conclusions into question, though they do not necessarily overturn 
them. There is evidently a great deal more to world petroleum supply than tech- 
nological progress, for domestic innovation has not so far proved a match for ad- 
vantages enjoyed by giant foreign fields that now contain the bulk of the world’s 
known oil reserves. In a free market, decisions made by many independent firms 
may be a better gauge of probabilities than any amount of interviewing of experts. 
In spite of restraints upon world petroleum trade, in recent years domestic oil has 
retained a declining share of the U. 8. market. The recent modest record of net ad- 
ditions to reserves as compared with past performance, and the reaction of domestic 
companies to it, suggest that the industry now has reason to be less optimistic about 
domestic oil prospe::ts than is the author. 

Nevertheless, this is an interesting book. The author brings to it an uncommon 
combination of skills and interests. He knows a great deal about petroleum tech- 
nology, and usually does a good job of communicating what he knows. Anyone who 





1 These notions, and related ones, are not wholly novel. Indeed, they have long enjoyed a certain popularity. 
Compare, for example, Report of the United States National Resources C ittee, T logical Trends and 
-National Policy, 1936; Hornell Hart, “Technology and the growth of political areas,” Technology and International 
Relations, ed. by William Ogburn, Chicago: University of Chicago Press, 1949. 

The predictability of invention has tly been questioned vigorously. John Jewkes, David Sawers, and 
Richard Stillerman, The Sources of Invention; New York: St. Martin's Press, 1958. 
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is interested in the petroleum industry can find something stimulating here. Although 
the data assembled are primarily from secondary sources, and the references favor 
the trade journals rather heavily, they form a good introduction to petroleum tech- 


nology. 


Distribution of Union Membership among the States, 1939 and 1953. Leo Troy. Oc- 
casional Paper No. 56. New York: National Bureau of Economic Research, 1957. Pp. viii, 
32. $.75. Paper. 


Maraaret 8. Gorpon, University of California (Berkeley) 


HERE has long been a need for statistics of union membership by state or region. 

In the absence of reliable data of this kind, it is difficult to determine what geo- 
graphical shifts in union strength are taking place or to analyze the factors responsible 
for these shifts. Yet the problems involved in the compilation of reliable data are 
formidable, since only two states collect statistics of union membership, and it is 
difficult to allocate membership accurately by states from the records that can be 
furnished by some two hundred national unions. 

The present report represents a valuable contribution toward filling this gap. The 
author was able to obtain records of local union membership from almost all national 
unions, as well as the cooperation and assistance of the national unions in determining 
how to allocate the membership by states. Because the job was time-consuming and 
costly, the estimates were limited to the years 1939 and 1953, but the results provide a 
basis for analyzing geographical differences in rates of growth during a period of 
rapid expansion in union membership. 

Despite their usefulness, the statistics must be interpreted with caution. The ab- 
sence of data on local independent unions represents an important gap, since the 
membership of these unions is unevenly distributed by states. Some of the other 
difficulties in interpretating the data are, if anything, understated by the author— 
the problems of defining union membership, of duplication of membership, and of 
allocating the membership of local unions which spread over several states, to men- 
tion only a few. 

In some respects, the difficulties presented by union membership data obtained 
from unions are analogous to the problems presented by employment data obtained 
from establishments. As Irving Bernstein has pointed out,' it would be extremely valu- 
able to have union membership data obtained from the decennial census, or at least 
from a census-type survey. Not only would problems of double-counting and dubious 
allocation by state or region be eliminated, but it would also be possible to anlyze 
the data by age, occupation, and other significant variables on which information is 
not usually available from union records, Clearly, also, the growth of union member- 
ship could be analyzed much more effectively if other states would inaugurate data- 
collection programs similar to those in effect in California and Massachusetts. 


Residential Finance, 1950. Richard U. Ratcliff, Daniel B. Rathbun, and Junia Honnold. 
For the Social Science Research Council in cooperation with the U. S. Department of 
Commerce, Bureau of the Census. New York: John Wiley and Sons, Inc.; London: Chap- 
man and Hall, Ltd., 1957,Pp. x, 180. $6.00. 


Sau B. Kuiaman, National Association of Mutual Savings Banks 


peearie who have had occasion to use the mass of detailed statistics on 
residential financing in Volume IV of the 1950 Census of Housing will welcome 
the summary and analysis of these statistics presented in Residential Finance, 1950. 





1 Irving Bernstein, “Measuring with a broken yardstick,” in Statistics of Labor-Manag t Relations, Institute 
of Industrial Relations, University of California, Berkeley, 1956, pp. 18-19. 
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This monograph is the seventh in a series of thirteen which have been, or soon will be, 
published on the 1950 Census of Population and Housing. 

The authors have done an admirable job of reducing over 600 pages of elaborate 
Census tables to 176 pages of easily readable text and summary tables. They have 
organized the vast amount of Census material into six well planned chapters (plus a 
brief introductory chapter) on over-all mortgage characteristics, lender practices, 
borrower characteristics, junior mortgage financing, financing of rental housing, and 
government mortgage guarantees. This arrangment and choice of subject matter is 
well suited to bringing out the main factual findings of the 1950 Census survey of 
residential financing. 

The reader should expect little more than a factual summary, however, for as 
the authors themselves warn “we have deliberately chosen to favor extensive treat- 
ment at the expense of intensive refined analysis (and hence cannot avoid) the tire- 
some recitation of fact inherent in extensive treatment. Detailed examination of a 
limited number of arbitrarily selected hypotheses would lend analytical elegance to 
our work ... (but) would run the risk of ignoring more important material.” This 
reviewer feels that there is a middle position which would have permitted some re- 
fined analysis in significant areas of mortgage finance within an over-all framework 
of extensive treatment. The “risk of ignoring more important material” is ever present 
in the work of the economic analyst and should not deter him from selecting, on the 
basis of knowledge and experience, those basic hypotheses which he considers worthy 
of intensive analysis. 

One of the shortcomings of adhering too closely to a recitation of Census facts is 
the absence in this monograph of any comparative analysis of pertinent data from 
other sources for years since 1950. Such analysis would have been particularly 
relevant in connection with some of the material in Chap. 3 on lender practices, not 
only because current data are readily available in this area but also because there have 
been such broad changes in mortgage portfolio composition since 1950 that the discus- 
sion of Census data is of historical interest only. Indeed, analysis of Census findings 
as if they had some currency, in a monograph published seven years after the date 
of reference, may tend to mislead the reader. 

In their discussion of mutual savings banks, for example, the authors state that 
“participation in both the FHA and VA programs is less extensive than in the case 
of commercial banks. . . . In terms of dollar volume, their holdings of conventional 
first and junior mortgages amount to two-thirds of outstanding mortgage debt” 
(page 26). Basic legal and economic changes since 1950 have markedly altered savings 
bank mortgage practices so that they have placed increasing emphasis on federally 
underwritten mortgages and in recent years have had a substantially larger pro- 
portion of their portfolios in these loans than commercial banks have had. Conversely, 
the proportion of savings bank holdings in conventional mortgages has been sharply 
reduced, and is now less than that of commercial banks. 

Similarly, the discussion of borrower characteristics in Chap. 4 would have been 
of considerably more interest and use if the 1950 Census results were compared with 
more recent findings of the sort included in the Federal Reserve annual Survey of 
Consumer Finances. Data for income and occupation of mortgage borrowers and for 
market value and outstanding indebtedness on owner-occupied homes, similar to 
those reported in the 1950 Census of Housing, have been available from the Survey 
of Consumer Finances for several years. Lack of complete comparability between 
these data might serve to qualify but not to invalidate conclusions. 

For areas in which later data are not available, it would have been useful for the 
authors to have commented on important market changes which have taken place 
since 1950 and which, in their judgment, have rendered obsolete for current analysis 
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some of the Census findings. Token recognition of basic changes was given in connec- 
tion with the discussion of junior mortgage financing: “Since the time of the enumera- 
tion of the 1950 census, there have been changes in the mortgage market which, no 
doubt, have altered the incidence of junior financing” (p. 70). Comments of this type, 
reinforced by brief analysis, would have been pertinent to several other sections of 
the monograph. 

A point on which I cannot resist comment, though it is minor in the framework of 
the monograph, is that the authors have failed, unfortunately, to make a conceptual 
distinction between techniques of mortgage acquisition and secondary market trans- 
actions. They have, in effect, accepted the notion, common among mortgage market 
participants, that the acquisition of any mortgage by an investor from an originator 
is a secondary market transaction even though the mortgage may have been newly 
originated on the basis of a firm commitment from the investor to purchase it. It is 
this special meaning which the authors have in mind when they cite the large volume 
of mortgage “purchases” by life insurance companies as evidence of their heavy de- 
pendence upon the secondary market. The bulk of such “purchases,” however, are 
not secondary transactions under the accepted economic concept of a secondary mar- 
ket—one in which trading takes place between investors in existing mortgages or 
securities. 

Most so-called “purchases” by life insurance companies merely reflect the in- 
direct origination of loans through agents or mortgage correspondents, a lending 
technique common to investors engaged in a nationwide mortgage program. In the 
economic sense, such acquisitions are primary market transactions just as are ac- 
quisitions of mortgages by life companies through their own branch offices, or direct 
origination of mortgages by lenders in local areas. The authors would have improved 
understanding of mortgage market techniques and processes if they had discussed 
the Census material simply in terms of methods of acquisition rather than in terms 
of secondary and primary market activity. It is not possible from Census data, un- 
fortunately, to determine the volume of “true” secondary market activity. It is 
generally known, however, that life insurance companies and mutual savings banks 
are more actively engaged in the acquisition of seasoned mortgages in secondary 
markets than are commercial banks or savings and loan associations. 

It would have been useful if the authors had discussed the reliability of Census 
data in cases where results clearly do not agree with common understanding of in- 
stitutional arrangements or market operations. The Census finding that mortgage 
companies are active purchasers of federally underwritten loans, for example, does 
not square with other information. It is not likely, either, in the light of recent find- 
ings, that they held as large a proportion of conventional mortgages in 1950, as in- 
dicated by Census data. (See my article on “Mortgage companies in the postwar 
mortgage market,” Journal of Finance, May, 1957). 

The relatively large percentage of high loan-to-value ratio conventional mortgages 
held by commercial banks and other institutional lenders, in some cases higher than 
permitted by statute, is admittedly unexplainable by the authors. Even allowing for 
liberal property appraisals, the reported finding is suspect that 22 per cent of all 
commercial bank loans and 15 per cent of all life insurance company loans, made to 
nonwhite borrowers were in excess of 100 per cent of purchase price. In the absence 
of discussion on statistical reliability, other Census findings not so clearly out of line 
are brought under some suspicion. 

The shortcomings of the monograph are not so many or so important as to impair 
its general usefulness. The authors have achieved their stated purpose, that of pre- 
senting an extensive summary of the mass of tabular material in the 1950 Census 
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volume on residential financing. They have covered a number of basic topics in a 
highly competent fashion and in a useful organizational scheme. As a reference work, 
this monograph will be a valuable tool for investigators. 


Consumer Instalment Credit, Part II, Vols. 1 and 2. Conference on Regulation. Board of 
Governors of the Federal Reserve System. Washington, D. C.: U. 8S. Government Printing 
Office, 1957. Pp. xxvi, 553; x, 161. $1.75; 0.60. Paper. 


Viota Wyckorr, New York University 


‘Too volumes are concerned with the case for and against selective control of 
consumer credit. The second volume takes up the pros and cons. The first volume 
considers consumer credit within our economy: its relation to economic stability and 
instability, wartime regulation, quantity and quality of consumer credit during the 
cycle, effect of general credit controls on consumer credit, borrowers, marketing 
practices, ete. 

The proceedings of the conference are composed of papers and comments by 34 
economists including Thomas Atkinson, Milton Friedman, Don D. Humphrey, F. A. 
Lutz, Geoffrey Moore, Robert Shay, and Arthur Smithies. The discussions are pro- 
vocative, and the first volume is comprehensive. 

The important questions considered in the second volume were: Does general 
credit control affect consumer credit? Are general controls adequate to control 
consumer credit? Is supplementary, selective control of consumer credit needed? 
Are adequate laws possible and feasible now? Would consumer credit regulation dis- 
criminate against individuals and enterprises? 

These questions were not settled. Concerning the first two question Eli Shapiro 
and David Meiselman stated, “The relatively fixed proportion of total bank loans 
that consumers and finance companies accounted for appears to contradict the asser- 
tion, at least for the 1953-55 period, that banks so favor providing funds to finance 
consumers that traditional weapons of monetary policy can have little effect on bank 
sources of consumer finance. On the contrary, banks respond to central-bank policy 
by altering the availability of funds both to consumers and to other borrowers, 
among them business borrowers” (1, 308-9). 

Marcus Nadler disagreed: “From 1946 to 1955, according to a recent Federal 
Reserve survey, member bank loans to sales finance companies increased twice as 
rapidly as all business loans. In addition, these companies were charged a lower rate of 
interest than any other business group. Even in periods of tight money, they [sales 
finance companies] have little difficulty in obtaining both short- and long-term funds, 
as was strikingly demonstrated in 1955” (2, 3). 

Practically everyone viewed the question of consumer credit regulation from a 
different angle, even when their conclusions were the same. Some economists agreed 
about arguments and facts but drew different conclusions, for example, Nadler and 
Lester Chandler. The latter opposed control and thought that “... attempts to 
impose controls on consumer credit might well weaken the prestige and power of the 
central bank to such an extent that it could not use effectively even its general re- 
strictive powers” (2, 30). 

There was little disagreement on the importance of consumer credit within the 
economy; its relation to economic stability or to inflation. Those who favored control 
based their arguments mostly on these considerations. Those who opposed it pre- 
ferred the operation of a “free market,” and thought that controls would be dis- 
criminatory. (Some economists were fascinated by the folklore on thrift and tried to 
disentangle their opinions as economists from their bias as savers or borrowers. 
Ethical considerations cropped up throughout the papers.) 
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The type of law which might best control consumer credit and how to administer 
it were referred to only in passing. The other shortcoming to this reviewer arose from 
the topic selected, that is from consideration of consumer credit separately from other 
types of credit, not as a part of all credit within the economy. Federal regulation, 
if concerned with quantity of consumer credit in relation to the economy, would need 
to dovetail consumer credit regulation with other credit regulation in order to be 
effective. State laws such as New York’s recent ones might well take care of quality 
of consumer credit and marketing practices. 

The arguments and material presented may not answer definitively the question of 
whether or not consumer credit should be regulated, but they do present powerful 
reasons why any agency might be reluctant to undertake the task. 


Consumption and Business Fluctuations. A Case Study of the Shoe, Leather, Hide Se- 
quence. Ruth P. Mack. New York: National Bureau of Economic Research, 1956. Pp. 
xvii, 293. $7.50. 


Danret B. Surts, University of Michigan 


I has always seemed to me that economic aggregation is largely a crude simplifying 
expedient to enable the economist to press forward his research in the face of data 
and method inadequate to handle the detail of underlying economic reality. It is this 
detail that is the stuff on which theory is properly built and the interrelations of 
which are of real scientific concern; aggregates are properly defined simply as the 
sum of the parts. Yet in our professional society prestige frequently seems to accrue 
in direct proportion to the extent to which detail is washed over by global totals, 
rather than the extent to which careful observation of many facts is welded into a 
unified structure. Thus a study of “Consumption and Business Fluctuations”— 
about as aggregative and global as you can get—is prestigeous, while the attempt to 
observe carefully, analyze and systematize the behavior of the “Shoe, Leather, Hide 
Sequence” must derive such reflected glory as it can by proudly attaching itself to 
the symbol of research prowess as a “Case Study.” 

I believe these values are backwards and wrong, and I think Ruth Mack believes 
so too, but I can see no other reason for the title of her book. It is not, I am happy to 
say,a study of “Consumption” or “Business Cycles” at all—at least not in the aggrega- 
tive sense in which these terms are understood. Rather, it is a thorough and patient 
study of the time pattern of the stocks, flows, and values of the shoe, leather, and 
hide industry. It is a detailed study centered on the idea that “we need to examine a 
wide variety of data not limited to the flow of goods and prices. We need to improve 
our ability to utilize these records for the purpose of observing and describing be- 
havior. We need to study objectives, techniques, evaluations and actions of business 
men, for only through such knowledge can useful hypotheses be formulated concern- 
ing the dynamics of interconnection among economic events” (p. 32). 

The organization of the study proceeds from a summary and statement of objec- 
tives to a brief description of the industry and its organization. This is followed by a 
cyclical analysis—more or less along lines familiar as standard National Bureau 
practice—of the time series of retail and wholesale shoe sales, leather and shoe pro- 
duction, and hide movement into sight, as well as retail and factory shoe prices, and 
prices of leather and hides. It is shown that while wholesale sales tend to lead, the 
remaining physical flows move together. The problem for study is the exploration 
of these facts—to explain them by careful reference to the behavior of consumers 
and producers at the several stages of production. 

The production and sales of the shoe, leather, and hide industry follow, of course, 
the general pattern of business fluctuations in the economy but particular interest 
is centered on the origin and propagation of industrial subcycles averaging about a 
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year and a half in duration. There were estimated to be over a dozen such cycles in 
the period 1922-1939 which the study covers. Comparison of the amplitudes and 
turns of these cycles at the several stages of production is integrated with study of 
business practices to give insight into the propagation mechanism. 

The path of the investigation leads directly upstream: chapters 5 and 6 are de- 
voted to consumer purchases, 7, 8, 9 and 10 to the role of the retailer, 11 and 12 to 
the leather buying behavior of the shoe manufacturer, 13, 14 and 15 to hide produc- 
tion, prices and tanning, and 16 and 17 to summaries of the findings. 

At each stage the history of the variables is referred to the relevant facts of business 
conduct. More often this is done qualitatively, but sometimes with the aid of re- 
gression analysis, and always with complete responsibility and full recognition of the 
limitations arising from method and data. 

As a contribution the study is a beginning rather than an end. It outlines the 
problems to be analyzed and points directions for next steps. This is not an easy 
book to read. Indeed the amount of positive effort which must be invested is well 
above average. Since the author is not a mathematical economist, both reader and 
writer must handle the analysis “in bulk” unaided by mathematical notation or 
equipment, yet the material dealt with surpasses in complexity and detail many more 
global models usually considered beyond the range of literary analysis. On the other 
hand the reader willing to invest the time and effort will be adequately repaid, for he 
will find himself treated, not as audience, but as peer and partner whose help is 
sought and opinions valued. 


The Measurement of Levels of Living with Special Reference to Jamaica. Colonial Re- 
search Studies' No. 24. C. A. Moser, Reader in Social Statistics, University of London. 
London: Colonial Office. Her Majesty’s Stationery Office, 1957. Pp. 106. 13s. 6d. Paper. 


Auice Hanson Jones, Washington, D.C. 


Fis concise booklet is much more than a presentation of selected data. It is a 

dexterous pilot study, sponsored by the Colonial Economic Research Committee 
and based on a nine weeks stay in Jamacia in the summer of 1955, of what is involved 
in a specific attempt to implement recommendations developed within the United 
Nations organizations.' The statistical methods are not involved, but the conceptual 
problems are intricate. Though focused on one under-developed colonial area at the 
present time, and though there are endless details which could be argued, this is a 
significant methodological and conceptual contribution of wide applicability. The 
author states his purpose “to examine, with reference to Jamaica, how best to measure 
changes in levels of living over time; to examine the adequacy of data available in 
Jamaica for this purpose; and incidentally to put together such relevant series as I 
could obtain.” 

A field study was not attempted; aid was received from the Central Bureau of 
Statistics in Jamaica, the Institute of Social and Economic Research at the Uni- 
versity of the Central West Indies, as well as from specialists both within and without 
the Government and the University. Moser’s return journey was routed via New 
York where he discussed problems related to his undertaking with experts in the 
Statistical Office of the United Nations. 

Interest of statisticians centers largely in parts II and III, the latter dealing with 
“The Main Methodological Issues.” A number of statistical series (the earliest be- 

1 Report of a Committee of Experts convened by the Secretary-General of the United Nations jointly with the 
LL.O. and the U.N.E.S.C.O., “Report on International Definition and Measurement of Standards and Levels of 
Living,” Document E/CN.3/179 and E/CN.5/299, New York, 1954, hereinafter referred to as the U.N. Report. 
For F.A.O. comment see Document E/CN.8/179. Add. 1, 18 March 1954; for W.H.O. comment Document E/CN. 


3/179. Add.2, 5 April 1954. See also ch. VII, “Social Action to Improve Levels of Living” in Robert E. Asher, 
Kotschnig, and Brown, The United Nations and Economic and Social Cooperation, Wash. D. C., Brookings Institu- 


tion, 1957. 
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ginning with 1879 and some shown separately for the 14 parishes into which the island 
is divided), some never previously presented, are given in some 50 tables in part II, 
“Case Studies: 1. Education; 2. Nutrition; 3. Health; 4. Housing; 5. Income, Ex- 
penditure and Consumption; 6. Concluding Remarks.” This part treats with nice 
discernment data limitations and problems of interpretation. Part I deals with 
“Concepts and General Approach” and part IV with “Future Work.” Anyone con- 
cerned with field work in or theoretical concepts of consumption economics should 
read the entire study, as it is difficult to do justice in a summary. 

Points of departure are the definitions and method suggested in the U. N. Report. 
These developed out of the problems of defining and measuring “levels of living 
(what is)” as distinguished from “standards (what might be)” or “norms (what ought 
to be)” and of making international as well as intertemporal comparisons of such 
levels.? The level (or levels) is considered as being made up of a number of components 
such as health, food and nutrition, education, conditions of work, and so forth through 
a list of twelve, and the measurement of each is first tackled individually, through 
various indicators. (The thorny matter of their combination into an over-all gauge 
is discussed in part III.) 

In parts II, III and IV he demonstrates the possibility of and necessity for compar- 
ing actual levels with standards (subjectively desired or perhaps based on achieve- 
ment elsewhere) or norms (physiologically determined or targets on other bases 
which may be set by international agencies or national governments). He takes ac- 
count of the interrelations between various components of levels of living and the 
interplay between levels and standards over time as well as in place comparisons. 

As crucial measures he seeks indicators, where possible classified geographically 
and by economic groupings (to yield distributions about the average) of (1) social 
resources, both physical and human (e.g. schools, hospitals, teachers, doctors) as 
well as of (2) their utilization (e.g. school attendance) and of (3) the end-product 
toward which they are aimed (educational level attained, adequate health). 

He suggests there is no point (no more than to combine capital and income) in 
combining indicators relating to resources with those relating to utilization or end 
results. For construction of composite indexes (separately) for resources, utilization 
and end products, he reviews the theoretically possible weighting alternatives. He 
suggests (p. 86) as the most reasonable basis “factor analysis” as used in American 
studies on farm level of living indexes, with consideration also of a technique of 
“discriminatory analysis” (p. 65). 

A quite different kind of combination of indicators of the first type more akin to 
national income accounting than to construction of index numbers is sketched (pp. 
87-8) as a perhaps more useful new tool. At regular intervals a “social resources 
account” (human as well as non-human), analogous to a national “balance sheet” 
could be drawn up for a country. Moser would reserve a place of priority for that 
“common measure of economic development . . . national income per head (at con- 
stant prices), but to get any idea of resultant improvements in social conditions and 
welfare one should surely go beyond it. This is the function performed by the many 
indicators proposed in the U. N. Report and the few I have covered in the present 
study. I now suggest that it might be worth trying to combine the resources indicators 
into a social resources account and, less hopefully, to assess the return obtained on 
these resources in terms of improved social conditions” (p. 88). 





2 Moser is concerned that possible confusion for a time may result from the agreement on the term “ievel of 
living” for what in British usage has commonly been called “standard of living,” but he follows the terminology 
recommended. He notes that at the 1956 meeting of the U.N. Statistical Commission “conditions of living” was 
suggested in place of “levels of living.” I am inclined to agree with him that this seems well worth considering. 
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Although he leaves many unresolved questions for further research, there is a 
nice, orderly marshaling of issues. Though he eschews specific recommendations, he 
cites many possible advances in administrative statistics at negligible or modest 
cost, and ways of enhancing partial surveys already programmed. 

He urges more intensive and informal field work than associated with modern 
questionnaire surveys, ideally “by a team of experts—-e.g. sociologist-statistician, 
nutritionist, doctor—aided perhaps by one or two high grade fieldworkers, who should 
be people experienced in social work or something similar.” He suggests that each 
household be studied over a period of a year to see conditions at their best and worst. 
“Allowing for ail the preparatory work, and analysis and reporting, a total period of 
three years would probably be necessary.” A detailed plan along these lines, linked 
with nutrition and preventive medicine programs might interest governments, re- 
search committees, foundations and the United Nations, particularly the Interna- 
tional Labour Organization. 


Statistical Sources for Market Research. The Market Research Society. Lingfield, Surrey, 
England: Oakwood Press, 1957. Pp. vi, 32. 10s 6d. $1.60. 


The Sources and Nature of the Statistics of the United Kingdom, Volume 2. Edited for 
the Council of the Royal Statistical Society by Maurice G. Kendall. London: Oliver and 
Boyd, 1957. Pp. viii, 343. 30s. 


Peter B. Kenen, Columbia University 


HESE two guides to British statistics are happy complements. The first is de- 
Tisiened to lead those of us who may have need of certain British statistics to the 
series we require. The second is designed to protect us against error in the use and 
interpretation of those statistics. 

The slender volume published by the Market Research Society is a classified list 
of the business statistics of the United Kingdom. It indicates the frequency, the form, 
and the place in which they are published and occasionally tells us something about 
the way in which they have been collected. It does not pretend to be comprehensive; 
the Society apparently takes a narrow view of market research, for it neglects en- 
tirely statistics on prices and on the banking system and the capital markets. What 
is more, it makes no attempt to appraise or compare the series to which it refers. 
But it does describe the principal sources in each of the areas it covers and it refers 
with pleasing frequency to the trade association and other nongovernmental data 
which the novice will not often discover on his own. It is indexed. 

The second of these two books is a collection of specialized monographs. These 
were first published in the Journal of the Royal Statistical Society but have in many 
cases been revised for inclusion in this volume. Nineteen essays describe and cri- 
tically appraise the available data on population; on the use of land; on the national 
income; on local government and regional economics; on banking, insurance and 
financial investment; on advertising; on medicine, medical services, education and 
crime; on international air transport and telecommunications; and on the clothing, 
food, petroleum, timber, and Scotch whiskey industries. They thereby fill many of 
the gaps left by the first volume in the Royal Statistical Society’s series. 

There are, however, three areas which deserve but have not yet received systematic 
treatment. Neither of these volumes thoroughly examines the available statistics on 
prices, on centrai government finances and the public debt, or on the balance of pay- 
ments. These subjects are surveyed elsewhere! but ought to have been included in 
this definitive study. They require and have not yet received the sophisticated treat- 





1 See, e.g., Ely Devons, An Introduction to British Economic Statistics. Cambridge: Cambridge University 
Press, 1956, and Edward Nevin, The Problem of the National Debt, Cardiff: University of Wales Press, 1954. 
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ment that characterizes this collection. There are, moreover, some serious gaps within 
certain of the articles in the Royal Statistical Society’s compendium. W. M. Dacey 
provides a superb analysis of the available data on Britain’s clearing (commercial) 
banks. Not only does he analyze the published statistics, he often adds to the data at 
our disposal. But he neglects entirely the discount market and the merchant banks. 
He also passes over the problems produced by recent changes in the clearing banks’ 
methods of valuation. John I. Mason and David Sachs, examining the statistics on 
financial investment, make no mention at all of The Economist’s excellent index of 
securities prices, nor do they mention the official statistics on new capital issues. And 
A. L. King, writing on petroleum statistics, might have said something about the 
finances of the industry, a subject of considerable interest to students of Britain’s 
balance of payments position. 

A few of the articles in this volume fall short of the high standard maintained 
through most of the book. J. E. G. Utting’s analysis of the national income statistics, 
for example, tends to degenerate into a catalog of the contents of the National Income 
Blue Book. He seems reluctant to comment upon the preparation or reliability of the 
data. Bernard Benjamin’s discussion of medical statistics is painfully complex. It 
is much more of an agenda for research than a survey of the materials available. A 
reader not already acquainted with the data he considers cannot expect to profit from 
a study of his article. 

But these blemishes do not much detract from the contribution which this volume 
and its predecessor make to our knowledge. They comprise a thorough and critical 
guide to the published statistics of the United Kingdom. The Editor and his associates 
are surely to be congratulated. If they would provide an index to their compendium 
we should be truly indebted to them. 


The Economics of the Fur Industry. Victor R. Fuchs. New York: Columbia University 
Press, 1957. Pp. x, 168. $5.00. 


G. R. Gregory, University of Michigan 
= the title suggests a work of considerably greater scope, this study was 
pointed toward a dual objective: (1) to find out what makes the fur industry 
“tick” the way it does, and (2) to throw some light on what the author feels are the 
two outstanding problems in the industry—short-run instability and long-run stagna- 
tion and decline. These more limited objectives are achieved: Moreover, they are 
achieved in a most readable fashion. 

To reach his first goal, Fuchs includes a concise but reasonably detailed description 
of the fur industry. Industry is interpreted broadly, including the production of raw 
fur skins on the one end and the retailing of fur garments at the other. The fact that 
uniform treatment is not given all segments of the industry proves to be no drawback 
since it is the little-known aspects of the industry that have been emphasized. Fur 
manufacturing, for example, is described in considerably more detail than is the re- 
tailing of fur garments. 

In accomplishing his second objective—-treating the problems of short-run in- 
stability and of long-run decline—Fuchs examines monthly time series from 1946 
through 1952 for evidence of nonseasonal cyclical movements (seasonal variation is 
obviously important and adjustments are made in the data to eliminate this effect). 
It is readily apparent that non-seasonal fluctuation in the fur industry is “fairly 
drastic.” It is less clear that 1946 through 1952 provides sufficient evidence to make 
meaningful the statement, “Fluctuations within the fur industry follow a discernible 
internal pattern with respect to amplitude, timing, and the transmission of impulses 
from one stage to another.” The conclusion that expectations play a leading role in 
causing the fluctuations, while not surprising, seems well founded. 
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The section on consumer demand, handicapped by a lack of statistical information, 
leans heavily upon the author’s apparent intimate knowledge of the industry. This 
knowledge is also put to good use in the discussion of the industry’s competitive 
structure. The conclusion that the highly competitive nature of the industry is one 
of its major handicaps seems well supported. The inclusion of a forthright set of rec- 
ommendations, while admittedly impossible to carry out because of legal, institu- 
tional, and psychological barriers, still provides the industry with a basis for self- 
improvement. : 

An outstanding feature of the book, and one which this reviewer would like to see 
employed more often, is the inclusion of an appendix containing much of the data 
used in the analysis as well as a description of the derivation of the time series utilized. 

The book should be of primary interest to members of the fur industry itself and to 
economists interested in industrial organization or in the theory of the firm or business 
cycle theory. It should also be received warmly by students of wildlife management— 
a field almost notable for its lack of economic study. 


Problems in the International Comparison of Economic Accounts. John W. Kendrick, 
Editor. Studies in Income and Wealth, Volume 20, The Conference of Research in Income 
and Wealth, The National Bureau of Economic Research, New York. Princeton, N. J.: 
Princeton University Press, 1957. Pp. x, 404. $8.00. 


Ep@ar O. Epwarps, Princeton University 


HE progress made in this book toward the improvement of international com- 
‘Daudanie of economic activity is almost as imposing as the conceptual and prac- 
tical hazards that remain. The subject matter covered in the six contributions ranges 
from an attempt to distinguish theoretically between economic and noneconomic 
activity to the construction of a comprehensive system of social accounts. Other con- 
tributors deal with index number problems, the reconciliation of different concepts 
of the government budget, treatments of government activity in social accounts, and 
internationa! transaction accounts. The critics whose incisive comments follow the 
various articles are often as thought-provoking as the contributors. Despite the un- 
even quality that is characteristic of most multiple-author works, the achievements 
of this volume are substantial. 

The most solid advance is scored by Morris Copeland, who constructs a compre- 
hensive system of social accounts in which he integrates domestic product and in- 
come accounts with balance sheets and flow-of-funds statements. Input-output data, 
requiring sectoring by establishment rather than by ownership, he would report in 
value terms in supplementary tables. The care with which he has related his system 
to the recently proposed UN system is itself no small accomplishment.' Too often 
such proposals are presented devoid of roots. 

The adoption of Copeland’s system of national accounts would involve substantial 
changes in the data presently reported for the United States. It would entail among 
other things (1) a switch from the national income concept to the domestic product 
concept (a move which Ruggles supports but which Jaszi questions); (2) a private 
noncorporate account which would, in common with the government account, in- 
clude both current and capital transactions on the grounds that decision-making is 
better analyzed in this way; (3) an account for existing fixed asset transactions and a 
contra entry in the private noncorporate account which would convert private non- 
corporate saving into net lending—a change that would permit the establishment of 
an account for financial transactions and the abolition of the saving and investment 
account. Most of these suggestions have as an aim the improvement of behavioral 





1 A System of National Accounts and Supporting Tables, United Nations, Statistical Office, Studies in Method 
Series F, No. 2, 1953. In the process Copeland subjects this document to the most searching review I have seen. 
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analysis. This is an important objective in its own right, but it must be conceded, 
I think, that its achievement rests on the intimacy with which the data on a country 
are related to its own institutions. It follows that the standardization of data and 
precision in behavioral analysis are likely to be incompatible goals. The purposes to 
be achieved by standardization involve instead welfare-rooted comparisons, a 
matter that is treated gingerly, when it is treated at all, not only by Copeland but 
by other contributors as well. 

To improve comparability between industrialized and underdeveloped areas, 
Copeland approves of more imputations and grants a temporary need for a rural 
sector. His standardized system emerges, however, heavily oriented toward indus- 
trialized countries on the rather optimistic assumption that “nonmoney communities 
will presently cease to exist.” Leontief in his comments advances the interesting 
hypothesis that much of the difficulty in getting comparable data on underdeveloped 
countries stems from just this kind of moneiary premise and the valuation problems 
it entails, rather than from a shortage of raw quantity data. The use of quantity 
data, he feels, should be further explored as a basis for a system of interlocking ac- 
counts before adopting a comprehensive system based on monetary values. 

Gerhard Colm in his article reviews some of his earlier positions on the treatment 
of government product in the national accounts. He continues to feel that any attempt 
to identify government intermediate product would give rise to larger errors than 
now exist in the United States data. On the other hand he has come to the conclusion 
that the imputation of interest on government capital would improve the accounting 
totals. Fabricant takes convincing exception to the first view arguing that if any 
amount of government product can be identified as intermediate the resulting esti- 
mates of gross national product would be improved. 

In developing the second view Colm suggests that the inclusion of imputed in- 
terest in national product would obviate his earlier proposal to include some of the 
actual interest payments of public bodies. The mingling of these two problems for 
whatever reason seems unjustified to this reviewer. The imputation of a return to 
capital remains a problem whether or not the money used for its construction is 
borrowed. Similarly the treatment of actual interest payments should be viewed 
independently of the uses to which the borrowed funds are put. The product for which 
interest is paid is the current use of purchasing power. One must examine the use to 
which the funds are put only to classify this product as final or intermediate. It might 
be decided, of course, to exclude the final product from the accounts on the ground 
that it arises from a redistribution of other more tangible products. But this argu- 
ment is quite different from a denial that there is any “current counterpart” to the 
interest payment which is Colm’s position. 

A useful and detailed reconciliation of the conventional, cash and Commerce 
statements of government receipts and expenditures is provided by Marilyn Young. 
Her contribution, a valuable and impressive reference document, is supplemented 
by the comments of Block and Landau of the United Nations who summarize some 
of the budget techniques used in other countries. The anachronistic nature of the 
United States treatment of government capital accounts is glaringly exposed. 

Working on the fringe of established systems of data, Herbert Wooley discusses 
the problems involved in elaborating a system of international transaction accounts. 
He is concerned mainly with how best to group nations (what purpose should domi- 
nate?) and how to handle multi-country petroleum, shipping, and merchanting 
transactions. Some of his transaction problems might be simplified if one could build 
on country accounts, with which the international ones should mesh, that are them- 
selves standardized whether on a national product or a domestic product basis. 

To those who use index numbers without seriously questioning the fundamental 
assumptions, the Brady and Hurwitz article should prove refreshing. The recent 
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multiplication of products, product qualities, and attached services (guarantees, for 
example) leads them to question the validity of matching one product with another 
for the construction of intercountry price indices. To avoid “the need for imputing 
identity to particular commodities,” they suggest “matching income levels at which 
the same percentage of income was spent on food, [necessities or other standards of 
equivalence].” Two groups of people that match in the selected standard are pre- 
sumably equally well off. If the hypothesis is accepted that the proportion of income 
spent by different groups within a country is a function of relative incomes, this 
technique might demonstrate, as Tobin suggests, that real income is the same all over! 
But given the unsatisfactory results achieved with index numbers, the hypothesis 
certainly warrants further testing. 

Starting from the “changelessness of man as a social being” and the universal 
“antithesis between ends and the limited means” Kravis searches for a means of 
delimiting economic activity. He proposes “(1) the rule of remunerated activities” 
and “(2) the rule of sensitivity to rewards.” The sensitivity rule, which receives the 
brunt of Hagen’s and Viner’s criticisms, is essentially a cross elasticity—the elas- 
ticity of time supplied to a nonmarket activity relative to the price of time supplied 
to a market activity. Thus if a rise in wages induces a man to spend less time in his 
garden relative to his other nonraarket activities (say, attending church), the garden 
work is an economic activity. But every activity, market or not, is only partly 
economic, and the economic-noneconomic ratio for any activity (say, garden work) 
undoubtedly varies substantially from one individual to another; where one decides 
“economic” stops and “noneconomic” begins can be likened to defining different 
shades of grey as either black or white. Yet the criterion, sharpened, may aid in 
identifying the extremes. 

The varied subject matter in the book would make it difficult for the average reader 
to use profitably were it not for the carefully prepared subject index, and the excellent 
summary written by John Kendrick as an introduction to the volume. All the con- 
tributors, experts in this field, would concur with his view that “comparisons of the 
national economic accounts can seldom produce wholly unambiguous meanings.” 
Can we hope that “standardization,” toward which this volume is a successful step, 
will not lead the “amateurs” to ignore the warning? 


Demographic Analysis: Selected Readings. Joseph J. Spengler and Otis Dudley Duncan, 
Editors. Glencoe: The Free Press, 1956. Pp. xiii, 819. $9.50. 


George J. Stotnitz, Indiana University 


HIs collection of sixty-three papers is a companion volume to the editors’ Popula- 
tion Theory and Policy, also published in 1956. The basis of division between the 
two readers is far from clear-cut but roughly the latter is oriented about the theoret- 
ical or welfare aspects, in general terms, of population analysis. The editors have 
sought to provide the foundations of such analysis in the present book, by illustrating 
demographic thinking in relatively “concrete” or “empirical” problems. The papers, 
all previously published, are arranged in seven groups: (I) Past and prospective 
growth and distribution of world population; (II) Mortality; (III) Fertility, nup- 
tiality, and replacement; (IV) International distribution of population and migration; 
(V) Internal distribution and migration; (VI) Population composition and utiliza- 
tion of human resources (including quality of population); and (VII) Selected re- 
gional studies. Contained in the last are descriptions, variously approached and for 
varying dates, of France, Italy, Germany, the Soviet Union, Brazil, Puerto Rico, 
Africa, Egypt, China, Japan, Indonesia, Southeast Asia and Burma. Brief intro- 
ductions are given before eath group of papers and a bibliography, taken largely 
from the postwar journal literature, is provided at the end. 
Alone or in combination, these readers are ambitious ventures by any standards; 
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they are easily the most ambitious ever undertaken in the field of population. The 
editors have aimed at an Analysis which “may be used by itself as a relatively self- 
contained basis for a course in demographic analysis, in combination with the com- 
panion volume for a course emphasizing both theory and research, or as a sourcebook 
of outside readings in conjunction with one of the standard texts on population prob- 
blems or methods of demographic research.” An apparent further aim, not stated 
explicitly, has been to accommodate the nonstudent seeking a convenient introduc- 
tion to the field. The editors have made a welcome attempt to overcome the continu- 
ing need for materials suitable to either purpose. 

How well they have succeeded in their “course” objectives will obviously depend 
upon teacher and class, but it seems safe to forecast that it will be the teacher rather 
than the student who will benefit in the main. Papers written by specialists for their 
peers are likely to presuppose context and the student will have to seek elsewhere for 
perspective and a sense of proportion. Although the editors’ introductions are gen- 
erally excellent, they are probably too concise and allusive to be very profitable to 
those who need them most. The decision to pay little attention to techniques is under- 
standable but will be considered by some to be unfortunate, considering the returns to 
be expected from even marginal efforts. The only current source of general instruc- 
tional interest which also provides an introduction to techniques is the text by Landis 
and Hatt, which is, however, almost exclusively concerned with American population. 

Pedagogical issues aside, both the specialist and nonspecialist should find much in 
these pages to reward their interest. As the above titles suggest, the range of topics 
is so broad that it may seem ungracious to ask for more. Yet there are some surprising 
gaps in emphasis. The remarkable mortality declines now under way in many under- 
developed areas—probably the most important single event in international demog- 
raphy since the war—receives only slight mention. The uninitiated reader is likely 
to come away with impressions on this subject which would be appropriate to the 
1930’s. The postwar recovery of fertiiity in Western nations is similarly neglected; 
the prevailing tone of most selections is more nearly that of 1940 than 1955. A decade 
of developmental efforts in the underdeveloped areas, in the face of accelerating 
population growth, finds no representation at all in these papers. The informational 
revolution stemming from the introduction of sampling methods in many parts of 
the world is overlooked. Paradoxically, many of the papers which seem most impres- 
sive in retrospect are in the sections on population quality and migration, although 
these areas have not been in the forefront of demographic investigation in recent 
decades. 

The statistician will find little of explicit technical interest in these pages but will 
surely be struck by the wealth of problems to which his skills could be profitably 
applied. To name a few: such questions as the relations between economic and fertility 
fluctuations among populations practicing birth control, proper methods of standard- 
ization under incomplete controls for residual heterogeneity, the effects of changing 
composition on vital trends and differentials among population aggregates, the 
stochastic and systematic factors relating cross-sectional and longitudinal data, an 
appropriate identification theory for distinguishing separate and composite effects 
on fertility, or the selective characteristics of migrant groups, have yet to register the 
impact of modern statistical approaches. 


The 1953 Pilot Population Census for the First Population Census in Sudan. Prepared by 
K. J. Krétki. Khartoum: Sudan Government, Ministry for Social Affairs, Department of 
Statistics, 1955. Pp. 183. 1£ Egyptian or 1£ Os 6d English currency. Paper. 

W. Parker Mavuupin, The Population Council 


Tos beautifully printed book is a detailed report on the Pilot Census in Sudan 
in which slight?y more than 52,000 persons in 10,511 households, in 9 sample 
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areas were enumerated. The detailed table of contents of almost 8 pages is indicative 
of the care with which this report has been compiled. These initial favorable impres- 
sions are strengthened by the presence of a map, photographs of local scenes, and of 
enumerators in various settings, s copy of the English and Arabic questionnaires or 
schedules used in the enumeration, ete. Also, the non-technical parts of the report 
have been written with an eye to attracting the interest of the reader and a number of 
examples of the unusual are given. The Pilot Census itself incorporated at least one 
novel feature in one sample area where a lottery was held for those participating in 
the Census, and emphasis was placed on the fact that prizes would be given only 
to those included in the Census and to those whose entries on the Census forms proved, 
upon investigation, to be correct. Moreover, the author has been quite frank in 
enumerating the problems encountered, and even in listing the failure to obtain a 
number of the objectives of the Pilot Census. 

The primary objectives of the Pilot Census were: (1) to determine variances in 
order to compute the sizes of samples necessary in different areas; (2) to test the 
questionnaire; (3) to determine whether “foolproof plans had been made for the 
translation, scrutiny and coding of the answers”; (4) to test the form of organization 
at headquarters, including machine analysis and preparation for it; (5) to test the 
existing Province and District organization; (6) to test the provisions made for carry- 
ing out the census in the field; (7) to test the various types of enumerators and super- 
visors available in a country with areas at such diverse levels of development and 
sophistication; (8) to determine the number of field personnel required and the 
time taken by different kinds of enumerators; (9) to test the efficiency and practicabil- 
ity of the proposed enumeration techniques; (10) to provide information on which to 
base a realistic census budget; (11) to provide information regarding optimum seasons 
of the year during which enumeration should be carried out; (12) to test the various 
arrangements for dealing with people speaking about one hundred different languages; 
(13) to determine the feasibility of using a pre-determined classification of items such 
as lists of tribes, languages, occupations, etc., and (14) to find out the truth about 
people’s superstitions and objections to being counted. 

The sampling plan of the Pilot Census was to obtain a sample from: 

11 areas of the country representing different types of settlement, modes of life, etc. 

4 Omodias within each ares (except in one town which was to be completely 
enumerated). 

16 Sheikhships within each sample area (except in the town). 

The sampling fractions for enumeration of households were planned as 1, 4, or 3, 
depending on the type of living arrangements found within each Sheikhship. 

The actual samples drawn deviated from the theoretical plan in several important 
respects: two areas refused to cooperate; the desired number of sample Omodias was 
obtained in only half of the 8 areas in which the Omodia was the sampling unit; 
in only half the areas were as many as 16 Sheikhships included; and the sampling 
fractions used were quite different from the plan. In some instances the reasons for 
failure to follow the plan were given, and were understandable, but in other cases the 
stated reasons were trivial, and in still other cases no reasons were cited. The failure 
to follow a prearranged plan as closely as possible is important for a number of rea- 
sons, not the least of which arises out of the fact that after the samples had been 
drawn and the field work completed a determination was made as to “What is the 
exact area of which each sample can be deemed to be representative?” We are told 
that “no harm was done and perfectly homogeneous and reasonable areas were ar- 
rived at, except perhaps in the case of Jebels and Kababish.” Unhappily, this part of 
the report is poorly organized, and it is time-consuming to find the various parts that 
explain the plan and the results. Details regarding setting boundaries for areas of 
which the sample was deemed to be representative are not available to the reader. 
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The basis of sampling within a Sheikhship was (1) to make a rough approximation 
of total population within the Sheikhship on the basis of available lists (tax, tribute, 
heads of households, etc.); (2) furnish the enumerator with a list of persons to be 
enumerated, and (3) instruct the enumerator to enumerate a specified number of 
households nearest to each of the listed names, or to enumerate all persons within the 
area chosen for the taking of the subsample. The estimation procedure was based on 
the assumption that listed and nonlisted households were randomly distributed 
within the inhabited areas. The validity of this assumption is highly questionable, 
and unfortunately there is no indication that any attempt was made to determine 
whether or not the bias in such a procedure was small or seriously large. In areas 
where the “nomadic technique” was employed, for example, the following instruc- 
tions were given. “To keep the expenses within limits and to cut down the amount of 
travelling and walking to reasonable proportions, the general instructions to enu- 
merators were to go to the largest congregation of members of the given Sheikhship, 
usually at a watering place. There he was to enumerate as many followers of the given 
Sheikh as he could lay hands on.” The author states that “omissions from the tax 
list are random,” but offers no evidence to support this assumption. If the Sheikhship 
extends beyond the watering place at which enumeration was carried out, there is the 
further assumption that the ratio of non-listed households to listed households is 
the same for all other areas in the Sheikhship as that found at the place of enumera- 
tion. This procedure does not live up to the author’s claims for sampling: “Where 
people lead a nomadic mode of life, complete coverage could never be guaranteed 
and double counting could not be avoided during a full count census. With a sample 
census these problems can be dealt with satisfactorily.” 

But in spite of serious deficiencies of the types mentioned above, this may be a 
useful guide for those working in underdeveloped areas such as the Sudan. The re- 
port is quite frank in listing its “failures,” such as: unsatisfactory classification of 
tribes, languages, and occupation; procedure for listing heads of households; the 
post-enumeration survey which was not well planned, and did not produce satisfac- 
tory results; occasional failure to ask about temporary absentees; translators not 
being consistent in translating some words; etc. It is questionable whether or not the 
cost figures shown in this report would be helpful in another country, but certainly 
they will be of use in the Sudan. The discussion of nonsampling errors is perfunctory 
and unenlightening. 

If one attempts to assess the accomplishments of the Pilot Census relative to the 
stated aims, the resulting score sheet may be somewhat discouraging. In part this is 
due to the fact that the aims were more extensive than they should have been. In 
spite of the deficiencies of the study, examples of which have been illustrated above, 
The Pilot Census yielded much valuable information for the Sudan, and, in addition, 
the description of the practical problems and solutions attempted will be of consider- 
able value to persons attempting sample surveys or censuses in similar areas. 


Information and Control. Leon Brillouin, Colin Cherry, and Peter Elias, Editors. Volume 1, 
No. 1, September, 1957. New York: Academic Press, Inc. Pp. ii, 89. $10.00 annual sub- 
scription for 4 issues. 


— to its own statement, Information and Control proposes “to publish, 
papers which make significant contributions to the theories of communication, 
computers, and automatic control, and also papers which present experimental 
evidence or theoretical results bearing on the use of ideas from such theories in any 
field to which the ideas are relevant . . . it will from time to time publish editorials, 
or papers of an editorial nature, discussing views of the relationships between fields 
. .. the editorial board is international in character. 

The first number contains nine short articles, six of which are by United States 
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authors and the others by French, Swiss, and English, but all are written in English. 
The longer articles have short abstracts. 


D. D. F. 


Operations Research for Management, Volume II. Joseph F. McCloskey and John M. 
Coppinger, Editors. Baltimore: The Johns Hopkins Press, 1956. Pp. xxxvi, 563. $8.00. 


W. W. Coorprr, Carnegie Institute of Technology 


7 book is a collection of writings by persons active in OR—operations research— 
and related areas. Prior to their selection, all papers were presented at The Johns 
Hopkins University Informal Seminar on Operations Research during 1953-54 and 
1954-55. The book is designed as a series of vignettes. Subtitled “Case Histories, 
Methods, Information Handling,” it attempts to provide material for a picture which 
will enable “management” to read meaning into the multifarious forms which opera- 
tions research might assume. 

Three common purposes which such “collected writings” might serve in the litera- 
ture of science may usefully be noted. First, as in the case of the A. Wald or R. A. 
Fisher collections, this device might be used to bring together a series of contributions 
from one person (or a team) so that they might be better understood and evaluated as 
a coherent whole. Second, more or less distinguished contributions by independent 
authors might be organized under one topic or theme and thereby demarcate some 
field of endeavor and supply an advanced starting point for those persons who wish 
(a) to obtain access to these results for their own work without being required to 
trace a labyrinth of specialized literature or (b) to attain a jumping off point at the 
frontiers of a series of developments. The Cowles Commission Monograph, Statistical 
Inference in Dynamic Economic Models, edited by T. C. Koopmans is an example. 

The (perhaps) extreme examples thus cited, with their emphasis on outstanding 
research problems, have been deliberately selected to distinguish another purpose 
which such a collection might serve: explication for a lay audience which either will 
not or cannot enter the professional literature. It is this third category which the 
present book attempts to serve (at least by title and stated intent). 

OR is a newly developed field. There is as yet no consensus on its form, content or 
specific areas of application. This may have tempted the editors to enter the second 
(as well as the third) category of purposes. An indication of this is apparent in the 
changed emphasis in the editors’ introduction to this volume—the second to appear 
in a series of three promised volumes under the same title. The extension was prob- 
ably also unavoidable (a) because of the rather considerable progress in operations 
research between the appearance, in 1954, of the first volume and manufacture of the 
second and (b) because the presentations incoporated in this volume were intended 
for a seminar with a rather healthy representation of experts. The book is as much 
an outgrowth of these forces as of a process of editorial selection and pruning. The 
resulting mixture makes it necessary to balance and assess the accomplishments of 
explication against the quite different requirements of technical excellence and com- 
pleteness. Both are attenuated. 

Nevertheless, the book can be recommended—even recommended highly—to the 
following two kinds of persons: (1) those who might want “to buy” operations re- 
search and (2) those who might want “to do” it—or find out whether they have been 
doing it. This recommendation carries with it a reading prescription. Both groups 
should read the book rather rapidly. This will provide perspective on what OR is 
about and, more particularly, the nature of the problems it has dealt with, its phi- 
losophy, and the tools it has utilized. (They may also find that they are less likely 
to draw a picture of what OR might be able to do—e.g., in as yet untried fields and 
problems—and where it is likely to go in the future.) 








622 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1958 

With few exceptions (e.g., Merrill Flood’s piece on “The travelling salesman 
problem”)! those who treat this as a category 2 book will be disappointed, especially 
if they believe that careful study of its contents will provide an advanced starting 
point from which they may either (a) adapt OR “findings” for use in their own field 
of interest or (b) jump to the frontiers of operations research. Such persons had 
better look to the professional journals, such as Operations Research or Management 
Science. Fortunately, these journals are recent in origin so the labyrinth is rather 
easily traversed. If some kind of preliminary “rounded” introduction is also desired, 
these persons will find a better book in Introduction to Operations Research (New 
York: John Wiley and Sons, Inc., 1957), a one-volume compilation edited by C. W. 
Churchman, R. L. Ackoff and E. L. Arnoff. 

The merit of the present volume is for those who seek a rapid survey. They will 
find the book divided into four main parts of uneven length and emphasis: (1) 
Introduction, (2) Case Histories, (3) Methods, and (4) Information Handling (small 
group psychology). Much, but not all, of this material has been published elsewhere, 
especially in Operations Research. 

The Introduction, consisting of an article “From Plato to the linear program” by 
the Earl of Halsbury needs to be read twice in order to allow hackles raised on a 
first reading to settle on the second. One way of regarding this piece is as an illustra- 
tion of an often encountered point of view which divides (presumably) theughtful 
and equally well educated people into two groups: (1) those who are “narrow tech- 
nicians” and (2) those who are not. Apparently, the Earl of Halsbury, or at least this 
article, belongs in the latter category. Certainly it is sweeping in scope. But almost 
every “technical” point—the ventured interpretation of Plato, animal and human 
ecology, the characterizations of J. M. Keynes, and an “application” of linear pro- 
gramming—may, at best, be characterized as a whopping half truth and, at worst, 
as misleading, ill-conceived, or erroneous. Nevertheless, the main conclusions are 
correct (or at least compelling) and display a grasp of the nature (and history) of 
scientific evolution. The point is made that OR is only a recent manifestation of the 
grand sweep of this evolution from ancient to modern times. Improvement in meth- 
odology is gradually (and in accelerating fashion) allowing science to break from its 
laboratory confines and confront those tasks, in the world of practical affairs, which 
the classic philosophers originally conceived as objects of rational inquiry. Those 
besetting problems of society which have not yet yielded to any other approach (force, 
philosophy and religious belief) will, it is hoped, prove more tractable to these emer- 
gent methods of science. It should be noted, however, that the scientific task as thus 
conceived involves more than acquiring substantive knowledge. It also requires the 
evolution of adequate means for dealing with the huge scope and size of such prob- 
lems (already apparent in many OR areas) involving thousands and hundreds of 
thousands of variables, equations, and related parameters. 

Part 2, Case Histories, is a potpourri, as it should be. It ranges over applications in a 
variety of fields: military, including an historical resume (by Leroy Brothers), in- 
dustrial (e.g., Dunlap and Jacobs’ analysis of dragline operations in mining phos- 
phorous), and local government (e.g., L. C. Edie’s analysis of traffic delays for the 
Port of New York Authority). ‘These applications are supplemented by further pro- 
posed applications in each area. The latter include the reported studies by Roger 
Crane (Analysis of a Railroad Classification Yard), William Vickrey (Revising New 
York’s Subway Fare Structure)* and Stanley Davis (A Study of Combat Stress in 

1 The articles by Christie and Luce form another exception. 

2 Readers should be warned not to confuse Vickrey’s equations or methods of estimation with those that might 
be founded on statistical methodology. Joint-cost problems, related traffic demands, and attendant difficulties ap- 
parently led Vickrey to an informal “jud ting” process for deriving his parameter estimates and 


to an informal model (not completely specified in the article) for interpreting his results. The “team approach” is 
often praised in operations research as a device (a) for bringing the talents of experts in different disciplines to bear 
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Korea). The tools utilized include statistics (with relatively little emphasis on sam- 
pling and experimental design), probability (including stochastic processes), mathe- 
matics, and a variety of simulation and approximation techniques. The concepts are 
also drawn from a variety of fields—business, economics, physics, etc. 

The next pari of the book, Methodology, again covers a range of topics. Some parts 
are simply attempts, at an elementary level, to acquaint OR workers with available 
methods. W. J. Youden’s exposition of experimental design is an example. Others 
represent responses to the practical need for methods of solutions. As in Glen Camp’s 
“Bounding the solutions of practical queuing problems,” these are attempts to deal 
with a class of problems whether encountered in OR or elsewhere. Still others are 
direct responses to situations confronted in an OR context. R. Zimmerman’s “Monte 
Carlo model for military analysis” and Walter Cushen’s “Operational gaming in 
industry” are cases in point. Finally, some attempt is made to balance the preceding 
materials by articles pointed toward new areas and possible problems of the future. 
Thus, D. B. Hertz and G. J. Feeney jointly essay a statement of a formal model for 
an OR application to the structural design of managed organizations and C. W. 
Churchman poses the question of whether OR practitioners (collectively and in- 
dividually) should accept the fundamental value judgments of their clients, cr 
whether they should subject these judgments to OR inquiry. Neither paper gets very 
far. But the Pandora’s box that Churchman opens cannot be closed without disap- 
pointing, in important ways, the levy that the Earl of Halsbury would ultimately lay 
on scientific inquiry. 

The final section of the book, Information Handling in Organized Groups, consists 
of a series of reports by L. 8S. Christie and Duncan Luce. Much of the material rests 
on work done at the M.I.T. Research Laboratory of Electronics and reported else- 
where—e.g., in the M.I.T. Reports (1952) “Communication and learning in task 
oriented groups” by Christie, Luce and J. 8. Macy, Jr., and (1954) “Information 
flow in task oriented groups” by Christie, Luce and D. H. Hay. Some updating is 
added, however, including discussion of findings by others such as Guetzkow and 
Simon. Emphasis is on laboratory experiments where small groups are allowed to 
communicate with each other over certain network channels and are given certain 
simple tasks to perform. Assessments are made of efficiency (time to perform task) 
learning (improvement in repeated trials) morale (reaction of group members to 
tasks and organization) and other psychological factors. Originating in the constructs 
of group psychology (e.g., as envisioned by the late Kurt Lewin and, more immedi- 
ately relevant to the reported studies, the suggestions of Alex Bavelas and Harold 
Leavitt) these studies extend into other areas as well: mathematics (topological 
analysis of graphs and networks), statistics (experimental design), machine simula- 
tion and analysis, information theory extended to semantic content, etc. 

The reports in this final section are highly suggestive and appear to contain many 
promising insights into the area of organizational analysis and prescription—an 
area which OR is now only beginning to plumb. The studies are not yet operations 
research, however, and will not become so until means are found for carrying these 
insights beyond the laboratory. While he should be prepared to use new scientific 
knowledge (from a variety of fields) as it becomes available, the OR practitioner is also 
finding that he must be equally prepared to go ahead of this knowledge as his prob- 
lems and opportunities require. Indeed, as he does this he can provide a service to 
science as well as to management and thereby lL. 'p to better the state of both. The 
area covered by Christie and Luce may well become a case in point judging from re- 
cent trends in operations research. 





on particular problems and (b) for stimulating research in these disciplines. The difficult nature of the statistical 
(and mathematical) problems encountered in this study might naturally have suggested this kind of approach to 
Vickrey but there is no report of any consideration having been given to it. 
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A significant fact in the growth of operations research is that it is providing a body 
of practitioners devoted to the entire range of management problems and possessed 
of the ability to use analytical concepts and disciplined scientific methods in attack- 
ing these problems. Statistical methodology stands in the mainstream of the po- 
tential developments which may emerge from this growth. To those who desire it, 
this book provides an opportunity to acquire some “feel” for the course that such 
developments might take. 


The Rationalization of Industrial Decision Processes. Engineering Experiment Station 
Bulletin No. 163. William T. Morris. Columbus, Ohio: State University College of Engi- 
neering, 1957. Pp. iv, 117. $2.00. Paper. 


Cuarues C. Hout, Carnegie Institute of Technology 


— bulletin, originally prepared as a Ph.D. thesis, is concerned exclusively with 
normative theories of decision making, particularly those that might be applied 
to “engineering” or “industrial” problems. Morris is critical of the older literature in 
the field of engineering economics which does not get far beyond interest calculations. 
He sees great hope, as does the reviewer, in the new work being done on decision 
theory and his book is an effort to make this theory available to engineers. 

The chapters deal with: introduction and history, rationality for engineering de- 
_ cisions, the measurement of values, theorems for probabilistic decisions, decision 
making with incomplete information, judgement, and the timing of decisions. 

The “axiom of long-run success” the maximization of expected utility is presented 
as an “axiom which appeals to the engineer.” This version of rationality serves as the 
basic orientation of the book. Decision problems in which there is an absence of 
knowledge concerning one or more of the following: utilities, probabilities, possible 
states of the world, or possible action alternatives, are transformed through observa- 
tion and learning until they are amenable to rational solution under the axiom of 
long-run success. 

The courage to undertake such a task in 112 pages can only be admired, but the re- 
sult is a rather cryptic, eclectic, and largely mathematical exposition. Because the 
exposition is abstract with few illustrative applications, the book is not likely to serve 
well as an introduction to practicing engineers who are unfamiliar with the field. An 
exception is the chapter on “Judgement in engineering decisions” which should be 
immediately useful. 

For statisticians the book may be of interest fer its broad coverage in a few pages 
and its references to the literature. 


The Computing Laboratory in the University. Preston C. Hammer, Editor. Madison: 
University of Wisconsin Press, 1957. Pp. xv, 235. $6.50. 


Franz L. Aut, National Bureau of Standards 


srs volume contains the papers presented at a conference held at the University 
of Wisconsin in 1955 for the purpose of discussing problems encountered in es- 
tablishing computing laboratories in universities. Since electronic computers have 
come into more widespread use, a number of universities plan to install computers 
as an aid to teaching and research. The articles presented here illuminate many of the 
results and obstacles to be expected. 

The book begins with three general papers, by C. A. Elvehjem, J. H. Curtiss, and 
J. W. Forrester, which should be profitable reading for the university president or 
provost concerned with over-all planning of a computer installation. “Don’t expect the 
laboratory to pay its way through computing services to industry. Do attach it to 
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the Department of Mathematics, but do make the computer available to members of 
other departments without inconvenience or cost. Don’t neglect applications in the 
social and biological sciences. Do acquire large digital machines before analogue ones.” 
Subjects of this nature are discussed, though not, of course, in these words. 

Ten papers about applications of automatic computation in science and industry 
barely scratch the surface of the field. Some of the authors concentrate on the prob- 
lems solved by them or their laboratories, rather than surveying all computer ap- 
plications in a given field; and some important applications have been allowed to fall 
between two chairs. We miss, for instance, descriptions of crystallographic X-ray 
diffraction analyses, of radiation penetration (shielding), of optical ray tracing, of 
elastic or molecular vibrations, all of which are bywords in automatic computation. 
We are grateful, on the other hand, for thorough surveys of computing in astronomy 
(W. J. Eckert), meteorology (P. D. Thompson), physical chemistry (J. O. Hirsch- 
felder), and for a detailed enumeration of computer applications in the aircraft in- 
dustry (H. 8. Wolanski). A more complete survey of computer applications would be 
useful to prospective founders or managers of university computing laboratories, 
since a main mission of such a laboratory is to educate other university departments 
in the use of computers. 

Four papers are devoted to “Future demands for personnel in computing,” a ques- 
tion which hardly needed a more elaborate treatment than this. The following group 
of papers on “Curriculums for computing” are what the department chairman really 
needs. In two detailed and factual papers by G. E. Forsythe and V. C. Rideout, and 
two briefer ones by C. W. Adams and D. M. Young, he is given not merely lists of 
courses but advice on the activities and general attitudes which will promote the 
ultimate purposes of his computing laboratory. 

There are five papers on “Equipping a university computing laboratory,” whose 
competent authors find more than enough space for saying all there is to be said on 
this subject, and in addition branching out into the philosophy of the laboratory, 
occasionally with contradictory conclusions, as is only fair. “Organizing and financing” 
is similarly treated by four writers, each of whom principally describes his home 
laboratory. 

The last chapter of the book, under the headiag “Computation in perspective” 
contains a single paper by J. H. Van Vleck, entitled “Dangerous Gulfs: Some Re- 
flections on the Social Implications of Computing Machines.” It presents a deepgoing 
analysis of scientific-engineering, economic, sociological and cultural aspects far 
transcending computation. The time for reading these few pages is well spent; the 
reader is rewarded with startling insights inte some of the present trends in our civil- 
ization. dj 


Work Sampling. Second Edition. Ralph M. Barnes. New York: John Wiley and Sons, Inc., 
1957. Pp. x, 283. $7.95. 


W. Grant Ireson, Stanford University 


rs M. Barnes has assembled in this small book a working manual for those 
who have need of sampling techniques to obtain reliable information about 
working time, performance rating, indirect incentive measurement, demand, and 
allowances. The book is divided into three parts. The first part is devoted to the his- 
tory, fundamental statistical techniques, and procedures for making a work sampling 
study. Part II presents sixteen applications of work sampling studies to a wide variety 
of problems. Through these applications, Barnes shows that work sampling is a 
rapid and economical means of obtaining the information necessary for such diverse 
purposes as setting work standards for indirect labor, improvement of work methods in 
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nonrepetitive tasks, determination of amount of work to be performed, and the set- 
ting of allowances for different kinds of work. He introduces the use of the Waiting- 
Line theory as an aid in planning for most economical use of personnel and equip- 
ment, and shows how the information for the application of these theories can be 
secured through work sampling. Part III introduces work sampling techniques for 
performance measurement and rating and the determination of the validity of the 
results. It also contains a complete reproduction of L. H. C. Tippett’s, “A Snap- 
reading Method of Making Time Studies of Machines and Operatives in Factory 
Surveys.” This was the original paper, published in the Shirley Institute Memoirs in 
November, 1934, from which the whole field of work sampling or ratio-delay has 
grown. 

The book has been written primarily for the engineer who has little or no statistical 
background. Chap. 3 introduces the concepts of probability, normal distribution, 
confidence levels, and absolute error in the most elementary forms with very little 
explanation of why these concepts are applicable to the subject. The standard de- 
viation of the binomial distribution is presented, eook-book style, as a means of setting 
the confidence level, and is used to determine the number of observations necessary 
to obtain the desired confidence level. To save the reader from computing the number 
of observations for a given confidence level and from the recomputation of the actual 
confidence level obtained after the observations are completed, Barnes includes in 
Chap. 5 both tables and charts from which these values can be obtained. He also 
includes both tables and alignment charts for estimating the absolute error. 

The Shewhart control chart concept is explained briefly in Chap. 4, and the p 
chart is used as a means of testing for homogeneity among subgroups of observations. 

The author emphasizes the importance of randomness in the work sampling pro- 
cedures and provides several pages of random digits. In Chap. 7 he explains in con- 
siderable detail how to use the random numbers for randomizing any or all of the 
factors involved in a work sampling study. 

One of the interesting parts of the book is Chap. 22, in which Barnes reports the 
results of 14 different studies in 8 different companies. The purpose of these studies 
was to develop a sampling technique that would give an accurate and economical 
method of determining the performance level of labor activities. Statistical techniques 
used in the study included the Chi-Squared test for normality of data, Tchebycheff’s 
Inequality, and the Central Limit theorem. 

While the book may be of little interest to statisticians, it should be valuable for 
industrial engineers in two ways. It presents a clear and interesting description of 
work sampling as a tool. Secondly, it illustrates how statistical techniques can be 
used by industrial engineers to reduce their work and improve the reliability of their 
recommendations and conclusions. 


Théorie des Fonctions Aléatoires, Applications 4 divers phénoménes de fluctuation 
A. Blanc-Lapierre and Robert Fortet, with a chapter on fluid mechanics by J. Kampé de 
Fériet. Paris: Masson et Cie, 1953. Pp. xvi, 693. 6000 fr in paper; 6500 cloth. 


Leo J. Tick, New York University 


T= volume, published as part of a “collection of mathematics texts for the use 
of physicists,” represents a complete survey of the probability aspects of random 
function (stochastic process) theory and applications at the time of publication. 
Roughly, the book is divided into four parts. The first part contains some introduc- 
tory material with physical background, and probability foundations. (The last 
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chapter, really an Appendix, is a brief survey of measure theory, group theory, and 
others which are used in the text.) The second portion treats of processes with in- 
dependent increments, including Poisson processes and processes derived frorn them. 
This is followed by an extensive study of Markoff chains and processes. The book con- 
cludes with a broad treatment of stationary processes. 

The use of random processes, as a physical science model, has had tremendous 
growth in the recent years. Unfortunately, the written material available in English 
has not been such as to provide a good introduction to an interested novice. It has, 
mostly, been of two varieties. (1) Those papers or books in which some particular 
physical problem is posed and solved. The unity of the methods used is not shown 
and the techniques are not related to any basic probability knowledge a reader might 
have; often the whole spirit of modern probability theory is missing. (2) Other papers 
or books written by the purely inclined mathematician in which intensity is purchased 
at the price of extensity. Very little connection with physical models is displayed. This 
makes it quite difficult for the physicist or engineer to read. This latter criticism is 
less true of Markoff and Poisson process work, but it is true about stationary proc- 
esses. The latter is one of this reviewer’s main interests so he should be forgiven for 
stressing this point. 

Taken as a whole this volume is a rich source of material on the theory of random 
processes viewed from the standpoint of physical applications, at least up to the time 
of publication. This last point should not be taken too seriously even by a current 
reader. There really have not been too many advances for the intended audience, i.e. 
those interested in applications. In fact, as will be pointed out below, there are some 
topics in stationary process theory which have hardly been treated at all in the 
English literature. 

This book is a fairly long one, and, instead of trying to discuss all of it, it seems, to 
this reviewer, better to dwell on those aspects which distinguish it, for good or ill, 
from other works. One of those most striking features is the completeness with which 
many of the elementary points are treated. This is good, as these are often the parts 
that the novice finds most difficult, particularly one new to probabilistic thinking. 
The authors are not adverse to diagrams and writing out each step. In contradistinc- 
tion, some readers have pointed out that obtuse and difficult points are lightly passed 
over. Perhaps the physical science audience will not be overly concerned with these 
deeper aspects of the subject. Another feature which this reviewer found most re- 
freshing was the explicit discussion of vector and multidimensional (time or space 
parameter) cases. It has always been most frustrating to read mathematical works in 
which the author states that generalization of two or more dimensions is immediate 
and obvious. Too often it is neither. This reviewer especially enjoyed the treatment 
of stationary processes. The introduction of the spectral decomposition through the 
study of linear filters is especially good. The recent discovery that scattering of radio 
waves from the troposphere or ionosphere can be a reliable means of over the horizon 
communication, has created a great interest in multidimensional processes. There is 
very little in the English probability literature beyond the type of statement given 
above. However, there are a large number of symmetry conditions possible in higher 
dimensions and they bear study. The reader will find explicit treatment of these 
processes in this work in the study of polarized light and turbulence. 

This is a probability text; there is no study of statistics except for a few pages 
on the estimation of spectra. From the above the reader has no doubt concluded that 
this reviewer commends this volume as an introduction te random functions for the 
physical scientist or anyone else. 
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Sons, Inc., 1957. $6.00. 

Brown, J. Douglas, and Harvison, Fred- 
erick. High-Talent Manpower for Science 
and Industry. An appraisal cd policy at 
home and abroad. Princeton, New Jersey: 
Industrial Relations Section, Princeton 
University, 1958. $3.00. 

Chamberlin, Fdward H. The Economic 
Analysis of Labor Union Power. Washing- 
ton, p. C.: American Enterprise Associa- 
tion, Inc., 1958. $1.00. Paper. 

Davenport, Wilbur B., Jr., and Root, 
William L. An Introduction to the Theory of 
Random Siganls and Noise. New York: 
McGraw-Hill Book Co., 1958. $10.00. 

Dorfman, Robert, Samuelson, Paul A., 
and Solow, Robert M. Linear Program- 
ming and Economic Analysis. New York: 
McGraw-Hill Book Co., 1958. $10.00. 

Goodman, Richard. Teach Yourself Sta- 
tistics. Teach Yourself Series. London: The 
— Universities Press, Ltd., 1957. 7/6. 

renander, Ulf, and gay Gabor. 
Toeplitz Forms and Their Applications. 
California Monographs in Mathematical 
Sciences. Berkeley and Los Angeles: Uni- 
versity of California Press, 1958. $6.00. 

Hunt, K. E. Colonial Agriculture Sta- 
tistics: The Organisation of Field Work. 
Colonial Research Publication No. 22. 
London: Her Majesty’s Stationery Office, 
1957. 14s. Paper. 

Johnson, N. L., Compiler. Statistical 
Ezercises. P -rt II. Analysis of Variance and 
Associated Techniques. London: Dept. of 
Statistics, University College, 1957. 12s. 


Paper. 

Kendall, M. G. A Course in Multivariate 
Analysis. No. 2 of Griffin’s Statistical 
Monographs and Courses. New York: 
Hafner Publishing Co., 1957. $4.50. 

National Bureau of Economic Research, 
National Accounts Review Committee. The 
National Economic Accounts of the United 
States. Reprinted from Hearings before the 
Subcommittee on Economic Statistics of 


the Joint Economic Committee, 85th Con- 
gress, Oct. 29 and 30, 1957. New York: Na- 
tional Bureau of Economic Research, Inc., 
1958. $.65. Paper. 

Morse, Philip M. Queues, Inventories and 
Maintenance. New York: John Wiley and 
Sons, Inc., 1958. $6.50. 

Ohkawa, Kazushi. The Growth Rate of the 
Japanese Economy. Tokyo: Kinokuniya 
Book Store Co., Ltd., 1957. 

Public Health, Department of, State of 
California. Health in California. Summary 
| a California Health Survey of 1954-56. 

eramento: California State Printing Of- 
fice, Documents Section, North Seventh 
St. and Richards Blvd. $1.50. Paper. 

Saulnier, R. J., Halcrow, Harold G., and 
Jacoby, Neil “H. Federal Lending: Its 

owth and Impact. Occasional Paper 58. 
New York: National Bureau of Economic 
Research, 1957. $1.00. Paper. 

Smart, W. M. Combination of Observa- 
tions. New York: Cambridge University 
Press, 1958. $6.50. 

Souther, James W. Technical Report 
Writing. New York: John Wiley and Sons, 
Inc., 1957. $2.95. Paper. 

Steck, G. P. Upper Confidence Limits for 
the Failure Probability of Complex Net- 
works. Albuquerque, New Mexico: Sandia 
Corporation, 1957. Available from the Of- 
fice of Technical Services, Dept. of Com- 
merce, Washington 25, D. C. $1.25. Paper. 

Strecker, Heinrich. Moderne Methoden 
in der Agrarstatistik. Wurzburg: Physica- 
Verlag, 1957. Paper. 

Taeuber, Conrad, and Taeuber, Irene B. 
The ~~ %, Population of the United 
States. New York: John Wiley and Sons, 
Inc., 1958. $7.75. 

United States ys of Agriculture. Agri- 
cultural Prices and Parity, Volume 1. Agri- 
culture Handbook No. 118. Washington, 
D. C.: Government Printing Office, 1957. 
$.50. Paper. 

United States Dept. of Commerce, 
Bureau of the Census. County and City 
Data Book, 1956. Washington, D. C.: 
Government Printing Office, 1957. $4.50. 

Wharton School of Finance and Com- 
merce, University of Pennsylvania. mp | of 
Consumer Expenditures, Incomes and Sav- 
ings. Statistical Tables, Urban U. W., 
1950. Philadelphia, Pa.: University of 
Pennsylvania, 1957. $6.00. 
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In continuation of the policy of accruing surplus whenever conditions warrant, to 
accumulate a total equal to one year’s income, the Board reports a moderate addition for 
1957 to the cumulative reserve. As is the case in many associations, rising costs may pre- 
clude further sizeable additions for some time unless the Association increases its dues as 
several others have done recently, or finds other sources of income. 


Publications 


The area of publications is causing considerable concern on the part of many inter- 
ested members. The necessity of providing a forum for the many varied disciplines using 
statistical theory and applications will require in coming years a growing expenditure for 
different types of publications. 

The Committee on Publications Policy—As part of its study of the whole problem of 
the Association’s publications, the Committee prepared a letter that was sent in August 
over President Leonard’s signature to members of the Board of Directors, the Council 
and officers of the five Sections, a total of approximately 60 persons. The letter requested 
views concerning various aspects of the problem including, for example, the matter of the 
number and kinds of articles in the Journal which are of interest to the members of the 
several sections. The letter also dealt with the Proceedings of the Business and Economic 
Statistics Section, The American Statistician, and the possibility of separate journals or 
issues of the Journal devoted to a specific area of interest. Following discussion of the 
replies, the Board and Council authorized that each Section appoint a liaison editor to the 
Journal. These have now been appointed and cooperation with them on the part of all 
members will help to make this important step successful. (The names of these liaison 
editors appear in each issue of the Journal on the editor’s page.) The complete statement 
of the Board and Council’s action appears in the October 1957 issue of The American 
Statistician. 

Following the completion of this study by the Committee, William 8. Connor asked 
to be relieved as Chairman due to the demands of other duties. Mr. Connor remains as a 
member of the Committee and Almarin Phillips, former Editor of The American Statis- 
tician, was appointed to succeed him. The Board has authorized a statistical study of the 
opinions of Journel users and the Committee is preparing the questionnaire that will be 
sent to a sample of the membership. The Association’s publications are one of its most im- 
portant activities. They merit careful re-examination at frequent intervals if the Associa- 
tion is to continue to serve the many diverse interests and fields where statistics is a tool 
of growing importance. 

1958 Membership Directory—lIn the fall, a brief questionnaire was sent to a 10% 
sample of the members, both domestic and foreign, requesting opinions on the inclusion 
of various items in the next edition of the Directory. As a result of the replies received, 
certain new items of information about cach member will be added to the alphabetical 
section of the book. The last edition of the Directory was published in 1954. 

Proceedings of the Business and Economic Statistics Section—The publication of a 
1957 volume of proceedings was authorized by the Board prior to the Annual Meeting last 
September. This edition which contains many papers presented in sessions of the Business 
and Economic Statistics Section at Atlantic City, will be available early in 1958. The 
1955-56 Proceedings, which appeared early in 1957, have now been sold. There were ap- 
proximately 1,200 copies printed and, consequently, almost all costs were recovered. 


Grants 


A grant of $9,000 to cover a three-year period was received in the spring of 1957 from 
the Carnegie Corporation. Its purpose is to pay travel expenses to international meetings 
for official delegates from the Association. A Committee was authorized by the Board to 
set forth the stipulations governing individual applications for travel assistance. The 
complete details concerning applications for the meeting of the International Statistical 
Institute in Brussels in 1958 are printed in the December 1957 issue of The American 
Statistician. Six other societies were each granted a like amount at the same time as ASA 
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and the Social Science Research Council received $150,000 for the se me purpose. Appropri- 
ate liaison is being worked out with other societies, with the Council and with the National 
Science Foundation. 

As reported previously, a grant of $19,640 was received from the Ford Foundation in 
1956 for the preparation of an index to the Journal of the American Statistical Association 
to cover Volumes 35 through 50. The work is under the immediate supervision of Boyd 
Harshbarger at Virginia Polytechnic Institute. An Advisory Committee of the Association 
chaired by William G. Cochran assists Mr. Harshbarger and his associates. 

Other Activities 

New Twin Cities (Minn.) Chapter—A charter has been granted to a new Chapter of 
the Association in the Twin Cities area of Minnesota. The number of chapters is now 36. 

Weiss Memorial Fund—The fund administered by the University of Pennsylvania in 
memory of Samuel Weiss, has reached a total of slightly over $2,500. The money will be 
used for student aid on the graduate level. 

Reduction of Dues for Cuba and Mexico—Beginning in 1958, the dues for members in 
Cuba and Mexico have been reduced to $5.00 per year. This brings the payment into line 
with other countries. 


Future Annual Meetings 
The Annual Meetings set to date are as follows: 
1958 Chicago, Ill. Congress Hotel December 27-30 Joint with Economic and 
Allied Associations 
1959 Wash., D. C. Shoreham Hotel December 27-30 Joint with Economic and 
Allied Associations 
1960 Cal*‘ornia Palo Alto Late August 
(Stanford) 
1961 New York City Roosevelt Hotel December 27-30 Joint with Economic and 
Allied Associations 
The Atlantic City meeting in September 1957 was very well attended. More than 
1,130 persons registered for the sessions and the hotel had to arrange for quarters nearby 
for many who were too late for reservations. The meeting was noted also for the largest 
number of individual sessions ever presented at an Annual Meeting of the Association. 
The Council of the Association, enlarged in 1956, is helping measurably to improve 
member participation in the Association’s activities. During the year, officers traveling 
in various parts of the country participated in Chapter meetings and plans for greater 
activity along these lines are being made for the forthcoming year. 


REPORT OF THE SECRETARY-TREASURER, 1957 


The surplus for 1957 amounts to $7,700. Added to the cumulative surplus to 
date, the Association starts 1958 with approximately $45,500 in reserve funds. The 1957 
surplus results more from a reduction in expenditures than increases in income. This is 
particularly true in the printing budget of the Journa!, though some other expense items 
are smaller as well. Some income items show an increase over budget, the most significant 
being advertising and Journal sales. 

Specifically, the 1957 budget was set at $71,925.00 for income and $71,845.00 for 
expense. Actual income is about $75,400, while expense for the year totals approximately 
$67,600. Note that the membership directory actual expense allocated for 1957 has been 
raised to $6,000 instead of $3,000 as budgeted. This is the expense for the 1958 directory 
on which preparation has recently begun. (Total Directory cost is expected to be $9,000.) 
Although it has been the usual policy to spread directory expense over a three-year period, 
a larger share has been allocated to 1957 instead of placing this $3,000 into surplus. There- 
fore, the final expense for the directory will be taken in the 1958 budget leaving 1959 free 
for other considerations. (The Journal Index to Volumes 35-50 may be ready for printing 
in 1959.) 

For a number of years, it has been reported in the annual audit that dues income 
exceeded the amount budgeted. The situation for 1957 is reversed—actual dues income 
fell short of budget. It is significant that for the first time in ten years the number of 
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members actually decreased. There are several possible explanations for this: the fact that 
ASA has a larger “fringe” membership than other societies, with these marginal-interest 
members dropping out for economic or other seasons; the lack of sustained membership 
promotion drives in 1957 (a condition that has already been corrected for 1958) ; the problem 
concerning publications which has deen pointed up in the report of the Board of Directors. 

The actual count of members at the beginning of 1957 was 5,691. During 1957, 530 
new members joined the Association and 32 reinstated their membership after a lapse of 
one or more years. However, 136 persons resigned or died and 450 were dropped for non- 
payment of dues. This last figure is considerably larger than preceding years when it has 
generally approximated 300. Thus, the Association starts 1958 with 5,667 members. 

As has been mentioned above, promotion campaigns in 1958 will be actively pursued. 
The results of the first invitational mailings have already been felt—the number of new 
members joining during approximately the last month of 1957 and credited to 1958 is 
more than twice the number joining in the comparable period a year ago. (Subsequently, 
the upward membership trend has again accelerated and membership now totals over 
6,000.) Journal subscriptions (to libraries, business firms, etc.) continued to increase. The 
number for this year is 1,784 as compared to 1,657 in 1956. Campaigns for new Journal 
subscribers (as distinct from membership) have been started as well. 

Income for 1958 has been budgeted at 74,970, with expenses at the same figure. In 
line with previous policy discussions in the Council and Board, a sum of $2,620 is being 
set aside from reserve to meet contingent Journal expense. This leaves the Journal with 
the budgetary latitude that is felt to be desirable. When additional suitable articles are 
available, the funds for an increased number of pages would be available also. Should the 
Journal printing expense as well as other expense items be substantially under the 1958 
budget, it is possible that we shall end the forthcoming year with another small addition 
to surplus. 

One other point should be noted concerning the 1958 expense budget. Due to the 
addition of another part-time person to the national office staff and, to a lesser extent, 
raises in salary, the expense item for salaries has increased. The growth in size and activi- 
ties of ASA over the past five years alone has necessitated hiring extra help. The full-time 
and part-time staff thus is once more at about the same size as that in 1950 when member- 
ship numbered approximately 4,200. Salaries paid are in keeping with the wages paid for 
similar services elsewhere in Washington. 

Every effort will be made in 1958 to increase income while practicing stringent econo- 
mies on expenses. The reserve fund must be increased in anticipation of future contingencies. 


December, 1957 Donaup C. RiLEy 
Secretary- Treasurer 


JAMES G. JESTER 
CrertiFieD PusLic ACCOUNTANT 
Tower BuiLpine 
Wasuinorton 5, D. C. 
April 22, 1958 
To the Board of Directors of the 

American Statistical Association: 

I have examined the financial statements of the American Statistical Association relat- 
ing to the year ended December 31, 1957. My examination was made in accordance with 
generally accepted auditing standards and, accordingly, included such tests of the ac- 
counting records and other auditing procedures as were considered necessary in the cir- 
cumstances, except that I did not check the membership and subscription records in detail, 
or make any independent verification of the inventory of back numbers of the Journals 
the office records of which are based, in part, on data assembled in prior years. 

In my opinion, the accompanying statements present fairly the financial position of 
the American Statistical Association on December 31, 1957, and the results of its opera- 
tions for the year then ended, in accordance with generally accepted accounting principles 
applied on a basis consistent with that of the preceding year. 

James G. JEsTErR 
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Exhibit A 
AMERICAN Statistical ASSOCIATION 
CoMPARATIVE BALANCB SHEET 
Assets 
December 31, 
1957 1956 
Cag Sa Danes ad O68 bee)... bike cic ste cede as $ 30,064.70 $20,133.63 
ONG GOs i oies ak 0 a cet nc bina pe Chediatinny ses 2,438.81 912.84 
Investments: 
United States Savings bond, Series G, due 1962, at cost.. 3,100.00 3,100.00 
United States Treasury 2}% bonds, $7,000 face amount, 
due Newediber 1061, at Ce06... occ be cece csieeess 9,566.88 6,700.94 
Federal Home Loan Bank Board...................... 5,000.00 
Federal National Mortgage Association................ 5,000.00 
Funds on deposit with savings and loan institutions— 
an Sob 5000s £0 PERS Sc d% 5s < + AS OM Manis Ge 54,392.00 56,909.09 
Inventories: 
Old Journals, at approximate cost..................... 3,113.49 2,922.00 
Monograph on Kinsey Report, at cost................. 2,746.61 2,848.87 
NSN ns. cans gars satiale HV cade 6 oye s «he 136.50 217.50 
Furniture and fixtures, at cost, less accumulated depreciation 2,234.08 1,660.42 
Deferred charges: 
PS IIE, s;). <5 2 so bp es ds Cn VUE ocd 4 ba 400 .33 527.35 
Re EE AEIEIND...«. 5 i su. sw gc bg hase cease 189.61 382 .00 
Sey SU da 440d, Cans sons toa ban ee COCR SE OES 200 .00 
Prepaid insurance and fidelity bond................... 604.00 202 .02 
I oss, ss sedan + Si ccn se dnay sete 1,181.32 
SN or. x a aa tain Uueiiers ken ene Ci anaes ie ee wha ae $120,168.33 $96,716.66 





GING WII. 6 nook a 6 0 Ss Sdteeaes apes nse rete. $ 11,304.44 $ 5,172.65 
Deferred income (collections applicable to 
subsequent years): 
ae en ic, ¢'e haba, «veo 4 vee PRET Che com $ 26,339.00 $20,740.00 


ES i ue UL. oan ape ek Hae eee REESE 7,792.40 7,073.40 
ae UD NE MN ok, oe vce a ow he tween ee ae 13,037.10 $19,640.00 
Carnegie Institution Travel Fund Grant............... 8,931.72 

SOE a.0'e ccs Wied as CRD < colle <2 bah Bieg WSR A hawe woud 466 . 84 2,467 .33 





$ 56,567.06 $49,925.73 





Deferred credit: 
Appropriation in 1957 toward publication costs of 





the 1958 Membership Directory.................... $ 3,000.00 
Net Worth: 
ROR TONING aici ins 5 iw i wiinwes sh ncmoes $ 3,743.95 $ 3,843.47 
SI NS ahaa a Bic aka wdc agaraw Care dian 45,552.88 37,774.81 
I ag cae, wesc, bis A Rode be otk win ae $ 49,296.83 $41,618.28 





Total Liabilities and Net Worth........................ $120,168.33 $96,716.66 
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Exhibit B 
AMERICAN STATISTICAL ASSOCIATION 
CoMPARATIVE STATEMENT OF INCOME AND SurRPLUS 
Year ended December 31, 














1957 1956 
Income: 
Dues—Old members, including Life members............ $41,523.40 $40,432.26 
eI i odie ta nds Qnew ede neeetee ; 3,873.75 4,683 .00 
ans nn sw v'c mea as &eeean 13,336.73 12,441.53 
—American Statistician................ 541.42 504.45 
EE CIE Fry a Lsepres 2,903 .55 1,661.08 
—American Statistician. . . PERE ee 918.37 699.76 
Diep Serre. i oe A 2,531.87 1,308.82 
<n GUNN RUINS 0's Scans cscs cevenseteecesns 197 .97 176.45 
—Kinsey Report Monograph.................... 231.66 602.00 
—Business and Economic Section Proceedings...... . 3,018.70 456.69 
—Emblems, less cost of sales....................-.. 47.85 62.67 
ES diva tees 5s bi 4:2 ee AA eA OH ly a 127.25 57.58 
Mailing list income................... oN canes i 1,418.45 1,473.91 
EE cL IEE cock cb es.cuusamuad - 2,593.68 1,631.69 
Annual meeting, including Exhibits..................... 2,137.88 2,231.84 
MUNI ees aciawdidd dennis, ... epee + 19.86 99 .58 
,  |t”:tsC ee Bo le. ae $75,422.39 $68,523.31 
Expenses: 
a a a a ae $17,578.43 $15,418.62 
Pension Plan expense............... Dy EN aR ees oats Oe 393.78 
Publications—Schedule I.................. 0000 ce eeeues 33,973.42 26,991.40 
eee fee ee ee oe ee eee 1,650.53 45.00 
Sahl ainage epee tlahcsiep, emlndlarornenge: cy "he ticalivete aie: 2,400.00 2,400.00 
Travel and Secretary’s expense... ...............00005. 1,142.43 732.55 
Supplies and office expenses....................... are 2,724.69 2,414.66 
IN. «0.5 ati te ieriaiaes anes bie dh oeee SE Geena ae eins « 1,973.03 2,071.07 
pe Ee es ee rie eee Bea 810.86 887.45 
SE TRS St ie id Pehle Coan Pesiaenes 970.00 970.00 
ey ee eet at wae t tates os 4 612.51 741.91 
Se I inns sos she cmbes peseecys 1,856.98 1,095.66 
Miscellaneous expenses—Schedule I.................... 1,557 .66 1,790.00 
$67,644.32 $55,558.32 
Excess of income over expense for the year................ $ 7,778.07 $12,964.99 
Add: Surplus account at beginning of year................ 37,774.81 24,809.82 





Surplus account at end of year—Exhibit A................ $45,552.88 $37,774.81 
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Schedule I 
AMERICAN SraTisTICAL ASSOCIATION 
Year ended December 31, 
1957 1956 
Publications: 

Samrat Peat iscind so 3 is aga Fe on + oe $16,191.05 $15,078.02 
GOI. 54. «s o.0 vie » cuss tpenees 14s eee 750.00 750.00 
—Editorial expense......................-..... 2,298 .67 2,493 .26 

$19,239.72 $18,321.28 

PN ee ay PS. AA 8,382.60 6,652.67 

Monographs—Kinsey report................-..-..-+-.- 102.26 190.17 

Business and Economic Section Proceedings............. 3,150.24 40.18 

Membership Directory (includes $3,00G budgeted in 1957 to- 

ward publication costs of the 1958 Membership Directory) 3,098 .60 1,787.10 

Total publications—Exhibit B......................... $33,973.42 $26,991.40 

Miscellaneous Expenses: 

BROORMa ict s 6:0'h.i-> ced 6es taken taht Lee $ 371.08 $ 308.71 

Dues to other organizations.....................-...... 100.00 123.25 

i oa nae bad ek Oem bes 050 «cba SAE re 477 .86 413.38 

Repairs and maintenance............................. 169.35 115.71 

I ata, os day udnd 4 dem bsed ese bes oae Kae ee 253 .97 251.91 

sd aS ba 6 dae Se wdc Ske Cis Res eck tone COs ae 185.40 577 .04 

Total miscellaneous expenses—Exhibit B................ $ 1,557.66 $ 1,790.00 





Funds on deposit with savings and loan institutions: 


American Savings and Loan Association................ $ 5,580.84 $ 5,390.53 
City Federal Savings & Loan Association............... 10,000.00 10,000.00 
Fidelity Federal Savings & Loan Association............ 9,482.44 9,113.33 
Guardian Savings & Loan Association.................. 5,000.00 5,015.63 
Hyattsville Building Association....................... 1,132.26 1,092.57 
Interstate Building Association........................ 8,140.48 
Liberty Savings & Loan Association.................... 3,714.96 3,605.70 
Marshall Savings & Loan Association................... 5,000 .00 

National Permanent Building Association............... 5,437.53 
Piedmont Federal Savings & Loan Association........... 9,481.50 9,113.32 
Trans-Bay Federal Savings & Loan Association.......... 5,000.00 





Total funds on deposit with savings and loan institutions.. $54,392.00 $56,909.09 








IBM STATISTICIANS 


Advantages of IBM 
Excellent opportunity for statisticians A recognized leadez in the electronic 
with background in engineering field. Omputer field . . . stable balance of 
MS. in Statistics, with major work in pee 00 melts eee 
math statistics. Minimum of two years’ _— relocation expenses .. . li one 
einai in statiotical k, prefer- benefits . . . salary commensurate wi 
ably with engineering applications. , i s 
Thorough knowledge of statistical dis- Owege and Endicott, N.Y. 
tribution functions. To handle anlysis- WRITE, outlining qualifications and experi- 
of-variance and multiple-regression type  °** *®: 





problems; to design experiments for Technical Recrvitment Mgr. 
various engineering applications; to se- IBM Corp., 599 Madison Avenve 
lect the form of statistical analysis of saw. ees SR, ame Tos 
greatest value; to give statistical support cuactase Twrnen Teas 

to engineering departments in such sreciag Uaee PRODUCTS 

areas as flight test, reliability analysis, oe 





and human factors engineering; to de- 
velop statistical programs for the IBM 
704. 


INTERNATIONAL 
BUSINESS MACHINES 
CORPORATION 

















Ph.D. MATHEMATICIANS 












Our Analysis Department has uke 5 for Ph.D. Mathematicians 
who have a lasting interest in some fiel athematics such as Algebra 
Geometry, Measure , Topo , Differential Equations, Numerica 
Anse, Statistics, or Pro’ ability Association will be with others 

similar educational background engaged in the study of challenging 
pon Bans vital to the nation's welfare. 


The work is of an independent research nature with supervision at a mini- 
mum and by people of similar interests who are aware of the needs of 
research personnel, 









Sandia Sepeeien, engaged in research and development 
of nuclear wea is located in Albuquerque, N.M.—a 
——— city of of 208,000 famous for its splendid climate 
recreational and cultural attractions. 
Excellent working conditions —! liberal vege a —— 
including rous vacations and holidays; retirement, hos- 
pital, and insurance plans. 













Please send résumé to 
Staff E: 
Division 501 


Please mention the Journal of the American Sraristicat AssociaTION in writing advertisers 























BIOMETRIKA 


Vol. 45, Parts 1 & 2 June, 1958 


CONTENTS 
Memoirs: 
ARMITAGE, P. Numerical studies in the sequential estimation of a binomial parameter. 
LESLIE, P. H. A stochastic model for studying the properties of certain biological 
systems by numerical methods. 


CERAM, J. G. On the derivation and applicability of Neyman’s Type A distribu- 
on. 


BLISS, C. I. & OWEN, A. R. G. Negative binomial distributions with a common k. 
BRASS, W. Simplified methods of fitting the truncated negative binomial distribution. 
COX, D. R. The interpretation of the effects of non-additivity in the latin square. 


ASHFORD, J. R. ntal to mixtures of poisons under conditions of simple 
similar cue SE eauipale 08 encentrelied data. 


MOORE, P. G. Some properties of runs in quality control procedures. 
WILLIAMS, E. J. Simultaneous regression equations in experimentation. 
RUSSELL, T. 8S. & BRADLEY, R. A. One-way variances in a two-way classification. 


PLACKETT, R. L. Studies in the history of probabili: d statistics. VII. The prin- 
ciple of the arithmetic mean. ung . ro 


BROWN, R. L. & FEREDAY, F. Multivariate linear structural relations. 
OLKIN, I. Multivariate ratio estimation for finite populations. 


BARTON, D. E., DAVID, F. N. & MALLOWS, C. L. Non-randomness in 8 sequence 
of two alternatives. I. Wilcoxon’s and allied test statistics. 


GOODMAN, L. A. Simplified runs tests and likelihood ratio tests for Markoff chains. 
LEIPNIK, R. B. Moment generating functions of quadratic forms in serially corre!ated 
normal variables. 


SAW, J. G. Moments of sample moments of censored samples from a normal population. 

HERDAN, G. The relation between the dictionary distribution and the occurrence 
distribution of word length and its importance for the study of quantitative 
linguistics. 

MacKENZIE, J. K. Second paper on statistics associated with the random disorienta- 
tion of cubes. 

WAUGH, W.A.O’N. Conditioned Markov processes. 


Miscellanea: Contributions by D. E. BARTON, D. E. BARTON & F. N. DAVID, 
H. BERGSTROM, C. W. CLUNIES-ROSS, H. R. B. HACK, M. HALPERIN & 
8. W. GREENHOUSE, G. HERDAN, A. HUITSON, N. L. JOHNSON, J. E. 
KERRICH, R. D. LORD, R. MAURICE. 


Oorrigenda: F. N. DAVID & M. G. KENDALL: A. J. MAYNE: E. L. CROW. 
Reviews 
Other Books Received 


The subscription, payable in advance, is 54/- (or $8.00) per volume (Including post- 
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throughout and statistical jargon is avoided. 
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Statistical Yearbook 1957 


A comprehensive collection of international statistics relating to economic, demo- 

graphic, financial, and social subjects. There are 191 tables, most of which cover a 
twenty-year period ending in 1956 or mid-1957. This volume oe ppd no 
new chapter on international economic aid, total assistance related to population and 
per capita national product, and the contributions of governments to inter- 
national ical assistance and relief ies. A feature of this issue is the large 
number of statistics it contains for the USSR and East European countries, Other 
new tables relate to ocean freight rates, discount rates of central banks, and number 
pal yer omg of books by subject groups and original languages. A United Nations 
Pp n. 


Sales No. 1957. XVII.1: cloth $8.00, paper $6.50 


Demographic Yearbook 1957 


This issue features mortality statistics in a historical series that analyzes and links 
the rates of death to age, sex, and cause, back through the 1930s. « compendium of 
official demographic statistics, it contains information on time trends of population 
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over sovereign countries, non-self-governing territories, dependencies 
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decline in mortality ut the world will supplement the 

“Technical Notes” that describe the limitations in comparability of the statistics. 
A United Nations publication. 


Sales No. 1957. XIII.1: cloth $3.00, paper $6.50 


Yearbook of International 
Trade Statisties 1956 


Volume I discusses total imports and exports (in national currency and U.S. dollars), 

trade i in gold, indices of quantum and unit value, currency conversion factors, trade 
in principal commodities (where possible according to the Standard International 

Trade Classification), and trade by principal countries of provenance and destina- 

tion. For each of the 115 countries regional summaries with id totals survey value 
of imports and exports (in U.S. dollars), provenance and tion of world ex- 
ports (in matrix form), indices of quantum, and unit value E exports. Volume II 
contains information on provenance and destination of trade in each of the follow- 
ing commodity classes (values in U.S. dollars): food, beverages, tobacco, raw ma- 
terials, fats and oils, fuels, chemicals, meminery, transport equipment, and other 
manufactured goods, A United Nations 


Sales No. 1957. XVII.6 Vol. I, Vol. II: Vol. I $7.00, Vol. II $1.50 


Please order these books by sales number. 
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ELEMENTARY STATISTICAL METHODS as Applied 
to Business and Economic Data, Revised Edition 


by WILLIAM A. NEISWANGER, University of Illinois 
from The Journal of the American Statistical Association: 
“Well written and widely adopted, the Neiswanger text now 
appears in an improved and modernized edition. This fine 
text has been generally known and highly regarded since its 
publication in 1943; the revision is both a comprehensive 
and meticulous job of bringing the volume up to date and 
perfecting the presentation. . . . The new edition is a thought- 
ful and careful revision in which the style, content, and 
emphasis of the most readable text in the field have been 
improved and brought up to date. . . . This review is ex- 
tremely favorable and in a sense the major criticism is also 
a compliment. In many ways the book is too good and too 
complete! It makes vigorous competition for the teacher; 
any teacher compares favorably with a poor text but only the 
best will outshine Neiswanger. 


“In brief it is a pleasure to report that this revision em- 
bodies many improvements over an originally very good 
text.” Lloyd Saville, Duke University 


1956 749 pages $7.25 


PROBABILITY, STATISTICS AND TRUTH 


by RICHARD VON MISES 
Second Revised En,lish Edition by HILDA GEIRINGER 


A translation of one of the internationally outstanding works 
on statistics, this is an analysis of the quantitative concept of 
probability in probability theory. 

1957 244 pages $5.00 


She Macmillan 6. 


60 FIFTH AVENUE, NEW YORK 11, N. Y. 
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Kenneth J. Arrow, Samuel Karlin, and Herbert Scarf 


This book is the first to explore and formulate the general mathematical 
concepts behind efficient inventory and production control. In a series 
of interrelated papers, inventory polices are studied under a wide variety 
of assumptions likely to be met in practice. The mathematics used repre- 
sent new develepments in the calculus of variations and in the theory 
of stochastic processes. Stanford Mathematical Studies in the Social 
Sciences. I. 350 pages. 67% x 10 inches. $8.75 


Stanford University Press, Stanford, California 
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NEW SECOND EDITION! NEW SECOND EDITION! 


MONEY, MEN AND MACHINES 


By Waddill Catchings and Charles F, Roos 


The internationally known authors of the new second edition of 
MONEY, MEN AND MACHINES take a firm stand against the 
erratic course of the Federal Reserve Board. 

The authors’ criticism is heavily documented and they advocate a 
sound monetary policy to provide for a stabilized, growing econ- 
omy. 


The bovok discusses the serious lem whether Congress had really 
intended to bestow upon the the economic powers which it 
now has. 

In this new edition, three important chapters were added. Vital 
negenls Saege OF Ge <OED One: Ge wane > Geane ale NOTES 


261 pages, price $3.50 
Published by 


THE ECONOMETRIC INSTITUTE, INC. 
230 Park Avenve MUrray Hill 4-7800 New York 17, N. Y. 
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The Computing Laboratory 
in the University 


Edited by Preston C. Hammer 


Thirty-one experts from the 1955 conference on the 
computing laboratory in the university discuss the 
place of the university in the computing field, as 
well as particular applications of computers in 
science and industry, the problems of training per- 
sonnel for computing, and ways to establish com- 
puting laboratories in universities. 

256 pages $6.50 


The University of Wisconsin Press 


430 STERLING COURT °* MADISON 6, WISCONSIN 
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